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ABSTRACT 

A  3100  bp  cDNA  clone  (pcDW-12)  of  the  26S  region  of  western  equine  encephalitis  (WEE) 
virus  strain  7 IV- 1658  was  identified  by  dot  blot  hybridization  fi-om  a  cDNA  library.  The 
missing  5'  end  was  PCR-amplified  and  engineered  into  pcDW-12  to  obtained  a  full  length 
clone  of  the  26S  region.  The  resulting  construct  (XH-7)  was  restriction  mapped  and 
completely  sequenced  on  both  strands.  Only  eleven  of  the  sixty-three  nucleotide  differences 
resulted  in  amino  acid  changes  when  the  sequence  was  compared  to  WEE  strain  BPS- 1703 
(Hahn  et  al,  1988).  In  addition,  the  high  degree  of  conservation  of  the  structural  proteins  was 
maintained  when  compared  to  the  N-terminal  sequence  of  the  El  and  E2  proteins  of  the 
McMillan  strain  of  WEE  (Bell  et  al,  1983).  The  conserved  nature  of  the  structural  proteins 
would  indicate  that  one  strain  of  WEE  should  be  able  to  cross-protect  against  all  WEE  strains. 
A  2.2  kb  fragment  at  the  5'  end  of  the  genome  was  also  cloned  and  sequenced,  and 
demonstrated  high  homology  to  the  available  sequences  for  eastern  equine  encephalitis  (EEE) 
and  Highlands  J  (HJ)  viruses,  adding  further  evidence  that  the  entire  5'  nonstructural  region  of 
WEE  was  derived  firom  EEE.  In  summary,  the  genetic  cloning  and  sequencing  of  the  WEE 
26S  region  marks  the  critical  first  step  in  the  generation  of  a  subunit  vaccine  to  WEE. 
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INTRODUCTION 

Western  equine  encephalitis  virus  is  an  alphavirus  which  is  serologically  related  to  Sindbis 
(SIN),  HJ,  Fort  Morgan,  Buggy  Creek  and  Aura  viruses.  It  appears  to  be  a  recombinant  virus 
between  SIN  and  EEE  viruses  (Hahn  et  al,  1988).  WEE  is  endemic  in  western  North  America 
and  subtypes/varieties  have  been  isolated  from  Argentina  (AG80-646),  Brazil  (BeAr  102091) 
and  the  former  Soviet  Union  (Y 62-33).  In  nature,  WEE  is  transmitted  to  man  and  horses  by 
mosquitoes  {Culex  tarsalis  is  the  principal  vector)  from  its  amplifying  host  or  reservoir  of  wild 
birds.  Infection  of  man  from  the  endemic  cycle  has  resulted  in  only  a  limited  number  of 
infections  in  recent  years.  However,  in  the  past,  major  epidemics  of  WEE  have  been  recorded. 
The  most  extensive  epidemic  occurred  in  the  western  United  States  and  Canada  in  1941,  with 
3,336  recognized  human  cases  and  300,000  cases  of  encephalitis  in  horses  and  mules  (Peters 
and  Dalrymple,  1990). 

The  incubation  period  for  WEE  is  between  5-10  days.  As  described  by  Peters  and  Dalrymple, 
1990,  the  infection  varies  from  mild  fever  with  headache  to  aseptic  meningitis  and  encephalitis. 
The  disease  usually  begins  with  the  sudden  onset  of  fever,  malaise,  headache,  nausea, 
vomiting,  and  occasionally,  respiratory  symptoms.  Vertigo,  photophobia,  lethargy,  abdominal 
pain  and  myalgia  are  common  symptoms.  After  a  few  days  the  headache  becomes  more 
intense,  and  in  severe  cases,  disorientation,  smpor,  tremors,  paralysis  or  coma  may  develop. 
Fatal  cases  usually  succumb  during  the  first  week.  Infants  and  children  are  more  susceptible, 
and  the  onset  may  be  more  abrupt,  often  associated  with  sei2mres  and  convulsions. 
Pathological  reflexes  vary,  but  commonly  include  weakness  and  hypoflexia,  and  children  often 
display  nuchal  rigidity,  disturbed  sensorium,  involuntary  movements  and  paralysis.  After  10 
days,  patients  usually  begin  a  gradual  convalescence.  Most  adults  will  recover  fully,  although 
infants  and  children  are  at  higher  risk  of  developing  permanent  damage  (Peters  and  Dalrymple, 
1990;  Home  eta/,  1991). 

The  pathogenicity  of  WEE  resembles  that  of  EEE,  although  it  is  less  neuroinvasive  and 
neurovimlent  in  both  humans  and  laboratory  animals.  The  primary  pathological  findings  are 
located  in  the  central  nervous  system  and  include  multiple  foci  of  necrosis.  The  case  infectivity 
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ratio  has  been  estimated  to  be  around  1 : 1 1 50  in  adults  and  drops  to  1 :58  in  children  and  1 : 1  in 
infants.  The  case  fatality  rate  is  between  3-8%,  although  the  infectious  dose  is  not  known 
(Peters  and  Dalrymple,  1990).  Of  note,  out  of  seven  reported  cases  of  laboratory-associated/ 
acquired  infections  with  WEE  (associated  with  egg  cultures,  suckling  mice  and  aerosols  from 
lyophilized  material),  two  deaths  were  recorded  (Office  of  Biosafety,  Laboratory  Centre  for 
Disease  Control  -  LCDC,  Canada).  No  direct  human  to  human  transmission  of  the  virus  has 
been  recorded  and  the  virus  is  not  demonstrated  in  blood  of  man  after  the  onset  of  disease 
(Peters  and  Dalrymple,  1990).  Highlands  J  virus  is  antigenically  related  to  WEE,  and  was 
previously  referred  to  as  the  eastern  variant  of  WEE.  It  does  not  cause  clinical  disease  in 
humans  or  horses,  although  a  case  of  encephalitis  in  a  horse  was  attributed  to  HJ  (Hayes  and 
Wallis,  1977;  Karabatsos  etal,  1988). 

Western  equine  encephalitis  virus  is  an  enveloped  virus,  containing  a  nucleocapsid  (NC)  which 
encases  a  single  strand  of  positive-sense  genomic  RNA.  The  RNA  is  estimated  to  be  near  12 
kb  in  length,  based  on  the  size  of  the  SIN  virus  genome  (Strauss  and  Strauss,  1994).  The 
nonstructural  proteins  are  translated  directly  from  the  5'  two-thirds  of  the  genomic  RNA,  while 
the  structural  proteins  are  translated  from  a  26S  subgenomic  mRNA  encoded  in  the  3'  end.  The 
26S  region  of  WEE  strain  BPS  1703  has  been  cloned  and  sequenced  (Hahn  et  al,  1988).  The 
structural  proteins  (NC,  E3,  E2, 6K  and  El)  are  processed  by  proteolytic  cleavage  from  a  single 
polyprotein.  Apart  from  the  NC,  the  other  proteins  are  posttranslationally  processed  in  the 
endoplasmic  reticulum  including  glycosylation  of  the  El  and  E2  proteins  and  the  addition  of 
lipid.  Extrapolating  from  research  with  SIN  and  Venezuelan  equine  encephalitis  (VEE) 
viruses,  neutralizing  domains  were  mapped  primarily  to  the  E2  protein,  and  to  a  lesser  extent, 
to  the  El  protein  (reviewed  in  Roehrig,  1986). 

An  inactivated  WEE  vaccine  is  in  Investigational  New  Drug  (IND)  status  and  requires  a 
minimum  of  three  injections  as  well  as  yearly  boosters.  Its  major  problem  is  the  relatively  poor 
and  short-term  immunogenicity  to  WEE  that  is  produced.  Currently,  no  antiviral  to  WEE  has 
shown  potential  for  treatment  of  infections,  although  a  synthetic  double  stranded  RNA,  poly- 
ICLC,  will  be  evaluated  at  DRES.  In  summary,  a  better  vaccine  needs  to  be  developed  for 
WEE.  At  the  US  Army  Medical  Research  Institute  Infectious  Diseases(USAMRIID),  research 
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is  progressing  towards  the  production  of  a  live  attenuated  WEE  vaccine  via  mutagenesis  of  an 
infectious  clone  of  WEE  virus.  Live  attenuated  vaccines  have  proven  to  provide  long-term 
immtmity,  and  in  many  cases,  protection  is  life-long.  However,  new  types  of  problems  can 
arise  with  these  vaccines.  The  live  attenuated  VEE  vaccine  (TC83)  has  had  trouble  with 
reversion  of  the  virus  to  a  more  virulent  form  (occurred  in  up  to  20  %  of  the  recipients  of  the 
vaccine  -  personal  communication,  J.  Smith,  USAMRIID).  A  second  potential  problem  to  be 
addressed  is  whether  a  live  attenuated  WEE  virus  will  interfere  with  live  attenuated  VEE  or 
EEE  components  of  a  trivalent  vaccine.  As  an  alternative,  USAMRIID  is  looking  at  a 
baculovirus-produced  subunit  vaccine  and  microencapsulation  for  enhanced  delivery  of 
inactivated  vaccine. 

At  DRES,  research  has  focused  on  the  development  of  subunit  vaccines  to  WEE.  Subunit 
vaccines  involve  inoculation  of  individuals  with  the  protective  component  of  the  virus 
(envelope  proteins).  Subimit  vaccines  are  generally  safer  than  live  vaccines,  causing  fewer 
complications  during  vaccination  and  no  risk  of  reversion.  They  can  be  produced  in  a  variety 
of  expression  systems  (i.e.  yeast,  vaccinia,  or  as  plasmid  DNA),  eliminating  the  need  for  high 
level  biocontainment  production  facilities.  Although  multiple  doses  are  usually  required, 
recent  advances  in  microencapsulation  and  DNA  immunization  have  allowed  the 
administration  of  a  single  dose  which  can  induce  a  protective  immime  response.  Liposomal- 
encapsulation,  in  conjunction  with  aerosol  delivery,  could  prove  effective  in  enhancing  the 
immune  response  (Wong  et  al,  1992;  Saravolac  et  al,  1996).  The  first  step  in  the  construction 
of  a  recombinant  vaccine  is  to  clone  the  relevant  sequences  firom  the  viral  genome.  In  this 
paper,  we  describe  the  steps  in  the  cloning  and  DNA  sequencing  of  the  26S  region  of  WEE 
virus  strain  71 V-1658. 

MATERIALS  AND  METHODS 

CHEMICALS  AND  REAGENTS 

Chemicals  and  reagents  were  purchased  from  Sigma  Chemicals  (St.  Louis,  MO),  Aldrich  (St. 
Louis,  MO)  and  Fisher  Scientific  (Edmonton,  AB).  Restriction  and  DNA  modification 
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enzymes  were  purchased  from  Gibco/BRL  (Burlington,  ON),  Pharmacia  (Baie  d’Urfe,  PQ)  or 
New  England  Biolabs  (Beverly,  MA).  Other  suppliers  include:  GeneAmp™  polymerase 
chain  reaction  (PCR)  reagents  purchased  from  Perkin  Elmer  (Foster  City,  CA);  Multiprime™ 
random  primer  labelling  kits  and  radioisotopes  from  Amersham  (Oakville,  ON);  avian 
myeloblastosis  virus  (AMV)  reverse  transcriptase  from  Life  Sciences  (Petersburg,  FL);  tissue 
culture  media,  sera.  E-link™  kit  and  Photogene™  reagent  from  Gibco/BRL  (Burlington, 
ON);  and  bacteriological  media  (Difco,  Detroit,  MI;  Gibco/BRL,  Burlington,  ON).  DNA 
primers  were  synthesized  and  gel  purified  at  the  Regional  DNA  Synthesis  Laboratory 
(Calgary,  AB)  based  on  the  published  WEE  26S  sequence  (Hahn  et  al,  1988). 

VIRUS  CULTURE  AND  PURIFICATION 

Tissue  culture  was  maintained  in  accordance  to  established  practices  (Bird  and  Forrester, 
1981).  BHK-21  and  Vero  cells  were  obtained  from  American  Type  Culture  Collections 
(ATCC  -  Rockville,  MD)  and  USAMRIID.  All  manipulations  of  WEE  were  performed 
following  Biocontainment  Laboratory  level  3  (BL-3)  requirements  at  USAMRIID.  A  10% 
suckling  mouse  brain  (SMB)  suspension  of  WEE  strain  7 IV- 1658  was  kindly  provided  by  Dr. 
Nick  Karabatsos,  Centers  for  Disease  Control  (CDC),  Fort  Collins,  CO.  A  seed  stock  of  71V- 
1658  was  made  by  inoculation  of  4  roller  bottles  of  Vero  19  cells  (USAMRIID)  with  70  pi  total 
volume  of  the  SMB  suspension  at  a  multiplicity  of  infection  (MOI)  of  less  than  0. 1 .  Each  roller 
bottle  of  cells  (90-95%  confluent)  was  incubated  with  the  virus  in  10  ml  of  Earles  minimal 
essential  media  (EMEM)  supplemented  with  10  mM  HEPES  buffer,  2  mM  L-glutamine,  100 
U/ml  penicillin,  100  pg/ml  streptomycin,  50  pg/ml  gentamycin  containing  5%  fetal  calf  serum 
(FCS)  -  5%  EMEM.  Incubation  was  continued  for  1  hr  at  37°C  and  an  additional  40  ml  of  5% 
EMEM  was  added  to  each  roller  bottle.  The  cytopathic  effect  (CPE)  was  allowed  to  proceed 
to  80-85%  (approximately  3  days).  The  lysate  was  harvested,  debris  removed  by  centrifugation 
at  10,000  X  g,  and  the  supernatant  aliquotted  and  stored  at  -70°C.  Virus  stocks  were  prepared 
by  infecting  20  roller  bottles  with  the  WEE  seed  stock  at  a  MOI  of  10.  After  cell  lysis  (80-85% 
CPE),  the  virus  was  precipitated  from  cleared  supernatant  by  the  addition  of  polyethylene 
glycol  (PEG)  MW  6000  to  7%(w/v)  and  NaCl  to  2.3%(w/v).  The  virus  suspension  was  gently 
stirred  for  15-20  hr  at  4°C,  followed  by  centrifugation  at  10,000  x  g  for  30  min  at  room 
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temperature  (RT).  The  pellets  were  resuspended  in  a  total  volume  of  10  ml  of  phosphate- 
buffered  saline  (PBS  -  0.01  M  sodium  phosphate  buffer,  27  mM  KCl,  0.137  M  NaCl,  pH7.4), 
and  5  ml  of  virus  suspension  was  carefully  layered  onto  30  ml  of  a  20-60%(w/w)  continuous 
sucrose  gradient  in  a  SW  28  tube  (Beckman,  Mississauga,  ON).  The  tube  was  centrifuged  at 
100,000  X  g  (27,500  RPM  in  SW  28  rotor)  for  3.5  hr  at  4°C.  The  virus  was  harvested  by 
removal  of  the  virus  band  with  a  pasteur  pipette.  The  virus  suspension  was  then  diluted  at  least 
1:3  with  Hanks  Buffered  Saline  Solution  (HBSS  -  Bird  and  Forrester,  1981),  and  centrifuged 
at  100,000  X  g  for  1  hr  at  4°C.  The  virus  pellet  was  resuspended  in  HBSS  or  PBS  (5-10  ml), 
aliquotted  and  stored  at  -70°C. 

ISOLATION  OF  RNA 

Viral  RNA  used  in  library  constructions  to  HJ  strain  B320  and  WEE  strain  71V-1658  were  a 
kind  gift  of  Dr  Dennis  Trent,  CDC  (Fort  Collins,  CO).  Subsequent  batches  of  RNA  were 
prepared  from  WEE  virus  strain  71V-1658  cultured  on  Vero  or  BHK-21  cells.  The  virus  was 
lysed  by  incubation  in  0.5  %  sodium  dodecyl  sulfate  (SDS)  and  RNA  extracted  using  the 
cesium  chloride  (CsCl)/guanidium  isothiocyanate  method  as  described  (Sambrook  et  al,  1989), 
with  the  RNA  being  pelleted  through  a  5.7  M  CsCl  cushion.  The  RNA  was  aliquotted, 
precipitated  by  the  addition  of  1/10  volume  of  3  M  sodium  acetate  and  2  volumes  of  ethanol 
(International  Biotechnologies  Inc.  -  IBI,  New  Haven,  CT),  then  stored  at  -70°C.  Prior  to  use, 
the  RNA  was  pelleted  15  min  at  11,600  xg  in  a  microfuge  at  4°C,  washed  with  80%(v/v) 
ethanol  and  dried  in  a  Savant  Speed-Vac  AES  1000  (Savant  Instruments,  Hicksville,  NY).  The 
pellet  was  dissolved  in  nuclease-free  water  (Promega,  Madison,  WI;  Gibco/BRL,  Burlington, 
ON),  and  alliquots  were  stored  at  -70°C.  Integrity  of  the  RNA  was  checked  on  formaldehyde 
agarose  gels  as  described  by  Sambrook  et  al,  1989.  Aerosol-free,  RNase-free  pipette  tips 
(Gordon  Technologies,  Mississauga,  ON)  siliconized  RNase-free  microfuge  tubes,  use  of 
diethylpyrocarbonate  (DEPC)-treated  dH20  for  buffers  and  aseptic  techniques  were  used  for 
all  manipulations  and  reactions.  Gel  boats  were  cleaned  with  70%(v/v)  ethanol  and  DEPC- 
treated  water  immediately  before  use. 
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cDNA  LIBRARY  CONSTRUCTION 

Libraries  to  HJ  strain  B320  or  WEE  strain  71 V-1658  were  made  by  Invitrogen  (San  Diego,  CA) 
using  the  following  protocol:  an  oligo-dT  primer  was  used  for  reverse  transcription  of  5  pg  of 
viral  RNA  to  cDNA.  After  synthesis  of  the  second  DNA  strand,  linkers  were  added  and  the 
dsDNA  was  size  selected  to  greater  than  2,000  bp.  The  sized,  electroeluted  DNA  was  ligated 
into  the  BstXI  site  of  prepared  pcDNAII  vector,  and  electroporated  into  electrocompetent 
DHlaF’  Escherichia  coli  cells.  The  cells  were  amplified  by  growing  on  LB  agar  containing 
50-100  pg/ml  ampicillin. 

MANIPULATION  OF  DNA 

Well  publicized  methods  were  used  for  handling  and  manipulation  of  DNA  (Sambrook  et  al, 
1989;  Ausubel  et  al,  1995).  These  include  the  use  of  aseptic  technique  for  the  handling  bacteria 
and  tissue  culture.  Where  possible,  high  grade  reagents,  nuclease-firee  water  (Promega, 
Madison,  WI),  siliconized  microfuge  tubes,  sterile  solutions  and  disposables  were  used. 
Phenol/chloroform  extractions  used  an  equal  volume  of  a  mixture  of  water-saturated  distilled 
phenol  (Gibco/BRL,  Burlington,  ON),  chloroform  and  isoamyl  alcohol  in  a  ratio  of  25:24:1 
ratio.  This  was  followed  by  ethanol  precipitation  using  1/10  to  1/20  volume  sodium  acetate  (3 
M)  and  2-2.5  volumes  molecular  biology  grade  ethanol  (IBI,  New  Haven,  CT).  The  DNA  could 
be  stored  indefinitely  at  -20°C  or  pelleted  10  min  at  11,600  x  g,  washed  once  with  80%(v/v) 
ethanol  and  dried  in  a  Speed- Vac™  AES  1000  centrifugal  vacuum  drier  (Savant  Instruments, 
Hicksville,  NY).  Nucleic  acid  was  quantitated  by  measurement  of  the  absorbance  of  an  aliquot 
of  the  sample  at  260  nm  using  a  Unicam  8730  spectrophotometer  (Mississauga,  ON). 

RESTRICTION  ANALYSIS  OF  DNA 

Restriction  digests  were  performed  by  dilution  of  DNA  (up  to  5  pg)  in  dH20  and  with  the 
addition  of  1  pi  RNase  (10  mg/ml,  heat- inactivated  by  boiling  20  min),  2  pi  restriction 
endonuclease  buffer  (lOx)  in  a  total  volume  of  20  pi.  Up  to  2  pi  of  restriction  endonuclease(s) 
were  added  (1-5  U/pg  DNA)  and  the  reaction  allowed  to  incubate  at  the  specified  temperature 
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(usually  37°C).  For  restriction  mapping,  digestions  were  allowed  to  proceed  for  2-3  hrs,  while 
for  DNA  fragment  isolation,  the  digests  were  incubated  for  exactly  1  hr.  Three  pi  of  lOx  gel 
loading  dye  [50%(v/v)  glycerol,  25%(w/v)  ficoll,  0.25%  (w/v)  bromphenol  blue,  0.25%(w/v) 
xylene  cyanol  FF]  was  added,  and  the  mixture  loaded  onto  an  agarose  gel  for  electrophoresis. 
The  gels  were  cast  in  removable  plastic  trays  (1 1 X  14  cm)  to  a  thickness  of  3-5  mm,  using  100 
ml  of  a  1.0-1.5%(w/v)  agarose  (Gibco/BRL,  Burlington,  ON;  Seakem™  or  Seaplaque™  - 
FMC  Bioproducts,  Rockland,  ME)  solution  made  up  in  gel  running  buffer.  The  gel  buffers 
used  were  lx  Tris  borate  EDTA  (TBE)  (90  mM  Tris-base,  90  mM  boric  acid  and  2  mM  EDTA) 
or  lx  Tris-Acetate  EDTA  (TAE)  (40  mM  Tris-base,  20  mM  sodium  acetate,  0.2  mM  EDTA, 
adjusted  to  pH  7.4  with  glacial  acetic  acid).  The  agarose  was  dissolved  by  microwave  heating 
of  the  agarose/buffer  suspension  (3-4  min  on  50%  power),  and  the  solution  cooled  to  40-50°C 
before  5  pi  of  10  mg/ml  ethidium  bromide  (EtBr)  was  added.  The  TBE  gels  were  generally  run 
at  80-120  V  and  the  TAE  gels  run  at  60-80  V  in  the  appropriate  lx  gel  buffer  (Sambrook  et  al, 
1989;  Ausubel  et  al,  1995). 

LABELLING  OF  GENE  PROBES  TO  HJ  AND  WEE 

For  the  screening  of  the  HJ  library,  the  HJ-3'  and  WEEPRO  oligonucleotides  were  labelled 
using  the  E-link  kit  (Gibco/BRL,  Burlington,  ON)  following  the  manufacturer’s  instructions. 
The  oligonucleotide  probes  were  also  5'  end-labelled  using  y-[^^P]ATP  (Amersham,  Oakville, 
ON)  and  polynucleotide  kinase  as  described  in  Sambrook  et  al  (1989).  For  WEE  library 
screening,  DNA  fragments  were  generated  using  standard  PCR  reaction  conditions  (10  mM 
Tris-HCl,  pH  8.3,  50  mM  KCl,  2.0  mM  MgCl2,  200  pM  of  each  dNTP,  2.5  U  Amplitaq™, 
1.0  pM  of  each  primer)  as  described  by  Perkin  Elmer  (Foster  City,  CA).  Optionally,  1  U  of 
Perfect  Match™  (Stratagene,  La  Jolla,  CA)  could  be  added  to  the  reaction  mixture.  Fifty  pi 
reactions  were  overlaid  with  mineral  oil  or  with  PCR  Gems™  in  500  pi  microfuge  tubes.  The 
reaction  program  consisted  of  25  cycles  of  1  min  at  94°C,  2  min  at  37°C  and  3  min  at  72°C, 
followed  by  a  final  7  min  extension  at  72°C  in  a  Perkin  Elmer  DNA  thermocycler  (Foster  City, 
CA).  The  primer  pairs  used  to  generate  PCR  fragments  in  the  26S  region  of  WEE  were 
WEEPROAVEEPO,  WEEP5AVEEN5,  WEEP7AVEEN7  and  WEE3'/WEEN0  (see  Table  1). 
The  reaction  mix  was  loaded  onto  a  l%(w/v)  TAE  agarose  gel  containing  0.5  pg/ml  EtBr. 
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DEAE  81  paper  (Whatman,  Hillsboro,  OR)  was  used  to  trap  the  band  of  interest  by  placing  an 
incision  ahead  of  the  DNA  band  of  interest.  A  strip  of  DEAE  8 1  paper  was  inserted  into  the  slot 
and  electrophoresis  continued  for  15-30  min.  The  DEAE  8 1  paper  containing  the  trapped  DNA 
was  then  washed  with  low-salt  wash  buffer  (10  mM  Tris-HCl,  pH  8.0,  0.10  M  NaCl,  1  mM 
EDTA),  followed  by  elution  of  the  DNA  with  50-100  pi  of  elution  buffer  (10  mM  Tris-HCl, 
pH  8.0,  1.5  M  NaCl,  1  mM  EDTA).  The  eluant  was  phenol/CHCl3  extracted,  followed  by 
ethanol  precipitation  (Danner,  1982;  Sambrook  et  al,  1989).  Alternatively,  the  fragment  was 
isolated  from  low  melting  agarose  gels  by  excision  of  the  fragment,  and  subsequent  phenol 
extraction  of  the  melted  agarose  (Sambrook  et  al,  1989;  Ausubel  et  al,  1995). 

After  washing  and  drying,  the  PCR  fragment  was  labelled  using  the  Multiprime™  random 
primer  labelling  kit  (Amersham,  Oakville,  ON).  Briefly,  25  ng  of  DNA  in  10  pi  of  dH20  was 
boiled  2  min  and  quickly  cooled  on  ice.  The  following  items  were  added:  1 1  pi  dH20;  4  pi  of 
each  of  dATP,  dGTP  and  TTP;  5  pi  buffer  concentrate;  5  pi  primer/BSA;  5  pi  a[^^P]-dCTP  in 
tricene  (3,000  Ci/mmole,  50  pCi  -  Amersham,  Oakville,  ON);  and  2  pi  of  DNA  polymerase  I 
Klenow  fragment  (lU/pl).  The  reaction  was  incubated  30-60  min  at  37°C,  after  which  50  pi 
of  TE  buffer  (10  mM  Tris-HCl,  pH  8.0,  1  mM  EDTA)  was  added.  The  free  nucleotides 
removed  by  passing  the  sample  through  a  1.0  ml  G-50  Sephadex™  (Pharmacia,  Baie  d’Urfe, 
PQ)  spim-column  as  described  (Sambrook  et  al,  1989)  or  through  the  use  of  Nuctrap™  Push 
Columns  (Stratagene,  La  Jolla,  CA)  following  the  manufacturer’s  instructions. 

PREPARATION  OF  COLONY  AND  DOT  BLOTS 

Colony  hybridization  was  based  on  the  original  procedure  of  Grunstein  and  Hogness  (1975)  as 
described  in  Sambrook  et  al  (1989)  to  liberate  the  DNA  from  bacterial  colonies  and  bind  it  to 
nitrocellulose  filters  (Hybond-C  Extra  -  Amersham,  Oakville,  ON).  Pieces  of  Whatman  3MM 
paper  were  cut  to  an  appropriate  size  and  shape  to  fit  into  the  bottoms  of  three  glass  trays.  The 
filter  paper  was  saturated  with  one  of  the  following  solutions:  10%  SDS;  denaturing  solution 
(0.5  N  NaOH,  1.5  M  NaCl);  or  neutralizing  solution  (1.5  M  NaCl,  0.5  M  Tris  Cl,  pH  7.4). 
Bacteria  to  be  screened  were  plated  out  onto  appropriate  media  (LB  agar  with  50-100  pg/ml 
ampicillin)  and  grown  overnight  (0/N).  The  nitrocellulose  filters  were  placed  on  top  of  the  1- 
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2  mm  colonies  in  the  90  or  150  mm  plates  and  left  30-60  sec.  The  position  of  the  filter  relative 
to  the  plate  was  indicated  by  using  a  18  or  20  g  needle  to  puncture  holes  through  the  filter  and 
into  the  plate.  These  holes  could  then  be  used  to  re-align  the  plate  to  an  autoradiograph  of  the 
filter  after  hybridization,  allowing  positive  colonies  to  be  selected  from  the  plate.  The  filters 
were  lifted  carefully  from  the  plate  and  placed  colony  side  up  on  the  SDS  impregnated  3MM 
paper.  After  the  first  filter  had  been  exposed  to  the  SDS  solution  for  5  minutes,  it  was 
transfered  to  the  second  sheet  of  3MM  paper,  saturated  with  denaturing  solution  for  5  minutes. 
The  filters  were  then  transferred  to  the  third  sheet  of  3MM  paper,  saturated  with  neutralizing 
solution,  and  left  for  5  minutes.  They  were  then  placed  on  a  clean  sheet  of  Whatman  3MM 
paper  (colony  side  up)  and  the  surface  of  the  filter  was  blotted  once  with  a  second  sheet  of  3 
MM  paper.  The  filters  were  transferred  to  a  tray  containing  2x  SSC  (0.3  M  sodium  chloride 
and  0.03  M  sodium  citrate,  pH  7.0)  and  gently  agitated  for  5  min.  They  were  dried  at  RT  for  at 
least  30  minutes,  sandwiched  between  two  sheets  of  3MM  paper  and  clamped  between  glass 
plates.  They  were  baked  for  1-2  hours  at  80°C  in  a  vacuum  oven.  Any  filters  not  used 
immediately  in  hybridization  reactions  were  sealed  in  plastic  and  stored  at  -70°C. 

Dot  blots  were  prepared  for  more  accurate  screening  of  recombinant  plasmids  with  labelled 
probes  (Kafatos  et  al,  1979).  Nylon  membranes  (Hybond-N™  -  Amersham,  Oakville,  ON) 
were  spotted  with  1-2  pi  of  plasmid  preparations  along  with  appropriate  controls.  After  air 
drying,  the  membranes  were  covered  with  plastic  wrap  and  placed  face  down  on  a  Fotodyne 
Model  3-3500  UV  transilluminator  (New  Berlin,  WI)  for  5  min  on  the  long  UV  setting.  The 
membranes  were  denatured  similarly  as  for  colony  filters,  using  3  min  in  denaturing  solution, 

3  min  in  neutralizing  solution  and  equilibrated  for  5  min  in  2x  SSC  (Grunstein  and  Hogness, 
1975;  Sambrook  etal,  1989). 

HYBRIDIZATION  TO  LABELLED  DNA  PROBES 

Hybridization  and  washing  with  the  HJ3'  and  WEEPRO  E-link  oligonucleotide  probes 
followed  the  manufacturer’s  directions.  For  [^^P]-labelled  probe  screening,  nitrocellulose 
filters  and  nylon  membranes  were  hybridized  utilizing  formamide  solutions  (Denhardt,  1966, 
McConaughy  et  al,  1969).  The  membranes  were  prehybridized  for  1  hr  at  42°C  in  sealed 
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plastic  bags  containing  10-20  ml  Hood  buffer  [50%(v/v)  formamide  (deionized);  20  mM 
sodium  phosphate  buffer  (pH  6.7);  5x  SSC  (0.75  M  sodium  chloride,  0.075  M  sodium  citrate, 
pH  7.0);  7%(w/v)  SDS;  1  %(w/v)  PEG  MW  20,000;  and  0.5%(w/v)  nonfat  powdered  milk 
(Carnation)].  For  large  numbers  of  filters,  teflon  mesh  was  used  to  separate  the  layers  of  filters. 
Hybridization  was  carried  out  in  5-10  ml  Hood  buffer  containing  10^  cpm  of  radiolabelled 
probe  (Sambrook  et  al,  1989).  After  hybridization,  filters  or  membranes  were  transferred  to 
polyethylene  trays  and  washed  with  two  quick  changes  of  2x  SSC,  l%(w/v)  SDS.  They  were 
then  washed  with  2  changes  of  2x  SSC,  l%(w/v)  SDS  at  42-60°C,  20  min  each.  A  final  wash 
of  20  min  at  RT  in  0.  lx  SSC,  l%(w/v)  SDS  with  gentle  agitation  was  performed,  and  the  filters 
or  membranes  were  sealed  in  plastic  for  exposure  with  Kodak  XAR-5  X-ray  film  (Rochester, 
NY). 

PREPARATION  OF  PLASMID  DNA 

A  scaled-up  version  of  the  Holmes  and  Quigley  (1981)  procedure  for  rapid  preparation  of 
bacterial  plasmids  was  followed.  Clones  identified  by  colony  hybridization  were  cultured  O/ 
N  in  2-5  ml  of  LB  broth  (10  g  tryptone,  5  g  yeast  extract,  5  g  NaCl  in  1.0  liter  dH20)  with  50- 
100  pg/ml  ampicillin.  The  cells  were  pelleted  for  30  min  at  3,000  RPM  (Beckman  TJ-6  bench 
top  centrifuge  -  Mississauga,  ON)  and  the  supernatant  discarded.  The  cell  pellet  was 
resuspended  in  200  pi  of  STET  buffer  [8%(w/v)  sucrose,  5%  triton  X-100,  50  mM  EDTA  and 
50  mM  Tris-HCl,  pH  8.0]  and  the  suspension  transferred  to  a  1.5  ml  microfuge  tube.  Twenty- 
five  pi  of  freshly  prepared  lysozyme  (10  mg/ml)  was  added  and  the  samples  boiled  1  min.  The 
tubes  were  then  spun  for  15  min  at  1 1,600  x  g  in  a  microfuge  at  RT.  The  supernatant  was 
transferred  to  a  clean  1.5  ml  microfuge  tube,  phenol-chloroform  extracted,  and  precipitated 
with  an  equal  volume  of  isopropanol  (Sambrook  et  al,  1989).  The  tube  was  placed  at  -20°C  for 
a  minimum  of  30  min,  and  then  pelleted  at  11,600  x  g.  The  pellet  was  washed,  dried  and 
dissolved  in  100-200  pi  sterile  dH20. 

Large  batches  ofE.  coli  plasmid  were  isolated  using  the  Qiagen  kit  (Chats worth,  CA),  a  method 
based  on  the  alkali  lysis  method  of  Bimboim  and  Doly  (1979),  following  the  manufacturer’s 
instructions.  A  second  method  used  for  large-scale  plasmid  preparation  was  by  a  procedure 
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involving  lysis  with  triton  X-100  (Davis  et  al,  1980).  Cells  were  grown  0/N  in  LB  broth 
containing  50-100  ^g/ml  ampicillin  (0.5-1. 0  liter),  then  pelleted  for  10  min  at  10,000  x  g 
(Beckman  JA-14  rotor  -  Mississauga,  Ont.).  The  pellet  was  washed  in  TES  (50  mM  Tris-HCl, 
pH8.0,  50  mM  NaCl  and  5  mM  EDTA)  and  then  resuspended  in  1.5  ml  of  cold  sucrose  buffer 
(25%  sucrose,  50  mM  Tris-HCl,  pH8.0  and  1  mM  EDTA).  After  10  min  on  ice,  0.15  ml  of  10 
mg/ml  lysozyme  (fresh,  in  dH20)  was  added  and  the  cells  were  left  for  an  additional  10  min 
on  ice.  Next,  0.6  ml  of  0.25  M  EDTA  was  added,  and  the  cells  were  incubated  for  a  further  10 
min  on  ice.  Slowly,  5.25  ml  of  triton  lysis  buffer  [2%(v/v)  triton  X-100,  50  mM  Tris-HCl,  pH 
8.0  and  60  mM  EDTA]  was  added  with  gentle  stirring,  and  the  mixture  left  on  ice  for  15-20 
min  Cell  debris  was  removed  by  centrifugation  for  25  min  at  37,000  x  g  (Beckman  J2-21, 
JA20  rotor  -  Mississauga,  ON)  after  which  the  volume  of  the  supernatant  was  adjusted  to  9  ml 
with  TE  buffer.  Nine  grams  of  molecular  biology  grade  CsCl  (Gibco/BRL,  Burlington,  ON) 
was  dissolved  in  each  sample  followed  by  the  addition  of  0.1  ml  EtBr  (10  mg/ml  in  dimethyl 
sulfoxide).  The  solution  was  transferred  to  Beckman  heat-seal  tubes,  filled  with  paraffin  oil, 
sealed  and  centrifuged  for  18-24  hrs  at  247,000  x  g  (Beckman  L8-70,  70.1  Ti  rotor  - 
Mississauga,  ON).  The  plasmid  band  was  collected  by  puncture  of  the  tube  using  a  1 8  g  needle 
and  3  ml  syringe  was  used  to  withdraw  the  band.  The  DNA  was  extracted  with  dH20-saturated 
N-butanol  (Sambrook  et  al,  1989)  and  dialyzed  against  TE  buffer. 

DNA  SEQUENCE  ANALYSIS 

Manual  sequencing  was  performed  using  the  Sequenase™  ver  2.0  reaction  kit  (Amersham, 
Oakville,  ON).  The  plasmid  template  (i.e.  pcDW-6,  pcDW-12,  XH-7)  was  previously  purified 
using  one  of  the  large-scale  plasmid  purification  methods.  A  1  pg/pl  plasmid  stock  in  dH20 
was  required.  Five  pg  of  plasmid  template  was  denatured  by  the  addition  of  1/10  volume  2  N 
NaOH  and  incubated  30  min  at  37°C.  This  was  followed  by  neutralization  with  1/10  volume 
of  3  M  sodium  acetate.  The  sample  was  then  precipitated  by  the  addition  of  4  volumes  of 
95%(v/v)  ethanol,  left  0/N  at  -20°C,  pelleted,  washed  and  resuspended  in  dH20.  For  each 
sequencing  reaction,  5  pg  of  denatured  template  in  7  pi  dH20  was  mixed  with  1  pi  of  primer 
(0.5  pmol)  and  2  pi  of  Sequenase™  reaction  buffer.  The  mixture  was  incubated  for  2  min  at 
65®C,  then  slow  cooled  in  a  beaker  of  water  to  around  30°C.  One  pi  of  1.0  M  dithiothreitol 
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(DTT),  2  fil  labelling  mix  (diluted  1/5)  and  0.5  ^l  a[^^S]dATP  in  tricene  buffer,  >1000  Ci/ 
mmol  (10  |iCi/|il  -  Amersham,  Oakville,  ON)  were  added  to  the  primer/template  mixture. 
Aliquots  of  the  primer/template  mixture  were  added  (3.5  pi)  to  each  of  four  tubes  containing 
2.5  pi  of  the  appropriate  dideoxy-terminator  mix  (ddGTP,  ddATP,  ddTTP  or  ddCTP)  which 
had  been  previously  prewarmed  at  37°C.  The  reactions  were  incubated  3-5  min  at  37°C  after 
which  4  pi  of  dye-stop  solution  was  added  to  each  tube.  Reaction  tubes  were  heated  2  min  at 
80°C  immediately  prior  to  loading  on  the  gel.  After  the  wells  were  flushed,  2-3  pi  of  each 
reaction  mix  was  loaded  in  sets  of  four  dideoxy-terminators  (GATC)  using  flat  Gel-Loader 
pipette  tips  (Gordon  Technologies,  Mississauga,  ON;  National  Scientific,  Claremont,  CA)  and 
a  Gilson  Pipetteman™  P-20  pipettor  (Mandel  Scientific,  Guelph,  ON).  After  a  2-4  hr  run  time, 
a  second  set  of  the  samples  could  be  loaded  into  any  remaining  wells.  Forty  well  sharktooth 
combs  were  used  for  40  cm  gels.  A  number  of  different  gel  formats  were  run,  but  generally 
consisted  of  8%  acrylamide  (1:20  bis-acrylamide:acrylamide  ratio),  8  M  urea  and  lx  TBE 
buffer.  Forty  cm  and  90  cm,  0.4  mm  thick  gels  using  Gibco/BRL  (Burlington,  ON)  SO  and  S2 
sequencing  units,  respectively,  were  run  at  40-50  W  constant  power  on  BioRad  (Mississauga, 
ON)  3000  power  supplies.  When  the  bromphenol  blue  dye  was  near  the  bottom,  the  gel  was 
removed,  fixed  with  three  changes  of  10%(v/v)  methanol/ 10%(v/v)  acetic  acid  (5  min  per 
change).  The  gel  was  rinsed  with  several  changes  of  dH20,  followed  by  transfer  to  a  sheet  of 
Whatman  2  or  3  MM  filter  paper.  Ninety  cm  gels  were  cut  into  two  equal  pieces  prior  to  transfer 
to  the  Whatman  filter  paper.  The  gel  was  covered  with  plastic  wrap,  and  dried  on  a  gel  drier 
under  vacuum  for  30  min  at  80°C.  The  plastic  wrap  was  removed,  and  the  dried  gel  and  was 
exposed  to  Kodak  (Rochester,  NY)  XAR-5  X-ray  film  (14"  x  17")  with  a  Dupont  Cronex™ 
intensifying  screen  (Mississauga,  ON).  The  X-ray  was  developed  in  Kodak  (Rochester,  NY) 
GBX  developer  and  fixer  and  analysed  on  a  Speed  Reader™  semi-automated  DNA  sequence 
imager  (Intelligenetics,  Mountain  View,  CA).  The  edited  sequence  files  were  transferred  to 
text  format  using  PC/Gene™  (Intelligenetics,  Mountain  View,  CA)  DNA  analysis  software, 
prior  to  transfer  to  the  Lasergene™  biocomputing  software  (DNASTAR  Inc.,  Madison,  WI) 
installed  on  a  Power  Macintosh™  8100/80  (Apple  Computer,  Cupertino,  CA)  for  sequence 
assembly. 


UNCLASSIFIED 


DRES-SR-633 


UNCLASSIFIED 


13 


Automated  sequencing  was  performed  using  an  Applied  Biosystem  Instruments  (ABI)  373 
automated  fluorescent  sequencer  and  the  ABI  Prism™  Dye  Terminator  Cycle  Sequencing  kit 
(Perkin  Elmer,  Foster  City,  CA).  Both  Amplitaq™  and  Amplitaq  FS™  were  used  on  Qiagen 
purified  dsDNA  plasmid  templates  according  to  the  manufacturer’s  instructions.  The  reactions 
were  run  on  a  Perkin  Elmer  thermocycler  (Foster  City,  CA)  using  30  cycles  of:  96°C  for  30  sec; 
45°C  for  30  sec  and  60°C  for  4  min.  Reactions  were  purified  on  Centri-Sep™  columns 
(Princeton  Separations,  Adelphia,  NJ),  dried  and  run  on  an  ABI  373  automated  sequencer 
(USAMRIID,  Regional  DNA  Synthesis  Laboratory).  Chromatographic  data  obtained  was 
downloaded  onto  the  Macintosh  8100/80  and  edited  in  SeqEd™  ver  1.0.3  (ABI,  Foster  City, 
CA),  or  transferred  directly  to  Lasergene™  (DNAStar,  Madison,  WI)  for  sequence  assembly. 
Thermosequense™  reactions  were  also  compared  to  the  Amplitaq  FS™  according  to  the 
manufacturer’s  protocol  (Amersham,  Oakville,  ON),  however,  the  results  were  no  better  than 
the  Amplitaq  FS™  (Perkin  Elmer,  Foster  City,  CA). 

CONSTRUCTION  OF  THE  FULL-LENGTH  26S  REGION  OF  WEE 


A  clone,  pcDW-12,  containing  a  3100  bp  fi-agment  of  the  26S  region  of  WEE  strain  71V-1658 
was  identified  from  the  WEE  library  by  hybridization  to  WEE  probes,  restriction  mapping  and 
DNA  sequencing.  The  5'  end  of  the  26S  region  was  synthesized  from  the  WEE  7 1 V- 1 65  8  RNA 
using  standard  PCR  reaction  conditions,  and  the  WEES'Sst  and  WEEP3  primers  as  outlined  in 
Fig.  1 .  Before  the  PCR  reaction  was  run,  a  reverse  transcription  (RT)  of  the  RNA  template  was 
completed  in  the  same  tube  using  the  PCR  reaction  conditions  by  programming  in  a  42°C 
“soak”  cycle  before  the  start  of  the  thermocycling  reaction.  The  changes  incorporated  were  as 
follows:  the  primers  were  allowed  to  slow  anneal  to  the  RNA  template,  followed  by  the 
addition  of  18  U  of  AMV  reverse  transcriptase  (Life  Sciences,  Petersburg,  FL)  and  2.5  U 
Amplitaq™  polymerase  (Perkin  Elmer,  Foster  City,  CA).  The  reaction  was  incubated  for  30- 
60  min  at  42°C,  and  then  the  standard  PCR  cycling  was  programmed  to  run.  The  PCR  products 
were  then  analyzed  on  agarose  gels.  An  1800  bp  fragment  of  the  estimated  size  (Hahn  et  al, 
1988)  was  isolated  as  previously  described.  The  firagment  was  then  restriction  digested  with 
Ncol  and  SstI,  gel  purified,  and  ligated  into  the  Sstl/Ncol  sites  of  plasmid,  phT3T7BM+ 
(Boehringer  Mannheim,  Laval,  PQ).  To  expand  on  the  details,  the  Sstl/Ncol  digested 
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phT3T7BM+  (gel  purified,  50-100  ng)  was  added  to  the  Sstl/Ncol  1800  bp  PCR  fragment 
generated  from  WEE5'SstAVEEP3  primers  (1:3,  1:1  and  3 : 1  molar  ratios)  in  a  total  volume  of 
15  pi  dH20.  Controls  with  digested  plasmid  DNA  only  and  insert  DNA  only  were  also 
included.  The  DNA  was  slow  annealed  from  65°C,  followed  by  the  addition  of  4  pi  of  5x  ligase 
buffer  (Gibco/BRL,  Burlington,  ON)  (250  mM  Tris-HCl,  pH  7.6, 50  mM  MgCl2, 25%w/v  PEG 
8000,  5  mM  ATP,  5  mM  DTT  )  and  1  U  T4  DNA  ligase  (Gibco/BRL,  Burlington,  ON; 
Pharmacia,  Baie  d’Urfe,  PQ).  The  reaction  was  incubated  for  4  hr  at  15°C  in  a  refrigerated 
waterbath.  Fifty  pi  of  TE  buffer  was  added  and  2-5  pi  of  diluted  ligation  mixture  was  used  to 
transform  competent  E.  coli  cells. 

The  transformation  protocol  of  Hanahan  (1983)  was  followed.  Competent  DH5a  (maximum 
efficiency)  E.  coli  cells  were  used,  and  purchased  from  Gibco/BRL  (Burlington,  ON).  Briefly, 
2-5  pi  of  diluted  ligation  reaction  was  added  to  100  pi  of  cells  in  pre-chilled  15  ml  snap  cap 
tubes.  The  tubes  were  left  for  30  on  ice,  heat  shocked  for  45  sec  at  A2°C  and  then  cooled  2-3 
min  on  ice.  One  ml  of  2x  YT  [1  %(w/v)  tryptone,  1  %(w/v)  yeast  extract,  0.5  %(w/v)  NaCl] 
or  SOB  [2  %(w/v)  tryptone,  0.5  %(w/v)  yeast  extract,  8.2  mM  NaCl,  2.5  mM  KCl,  10  mM 
MgCl2,  10  mM  MgS04]  broth  was  added  and  the  tubes  incubated  at  37°C  for  30  min  with 
gentle  agitation.  The  cell  suspension  (100  pi)  was  plated  out  onto  a  LB  plate  containing  50- 
100  pg/ml  ampicillin. 

A  recombinant  plasmid  containing  the  1800  bp  WEE  5'  end  (phTW5'-3)  was  identified  by 
restriction  analysis  of  rapid  plasmid  preparations  and  confirmed  by  fine  restriction  mapping 
and  DNA  sequence  analysis.  The  Xbal/Ncol  fragment  of  phTW5'-3  (WEE  5'  1800  bp 
recombinant  in  phT3T7BM+)  was  ligated  into  the  Xbal/Ncol  sites  of  pcDW-12.  After 
transformation  into  DH5a,  a  full-length  recombinant  containing  the  complete  WEE  71 V-1658 
26S  region  was  identified  (XH-7)  for  further  analysis.  Additional  cDNA  constructs  to  WEE 
strain  71V-1658  were  made  using  primers  WEE5'Sst  and  WEE3'HindIII  (see  Table  1)  for  the 
RT-PCR  amplification  and  directional  cloning  of  the  full-length  WEE  26S  into  Invitrogen 
vector  pRSETA  (cloned  into  the  SstI  and  Hindlll  sites). 
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PCR  AMPLIFICATION  OF  THE  NONSTRUCTURAL  REGION  OF  WEE 

A  PCR  cloning  strategy  was  employed  to  produce  a  set  of  three  overlapping  fragments 
(WNST-A,  B  and  C)  for  the  nonstructural  region  of  WEE71V-1658  (Fig.  2).  RT-PCR 
reactions  were  carried  out  using  three  sets  of  PCR  primers  as  listed  in  Fig.  2.  For  WNS-A 
fragment,  two  different  combinations  of  primers  were  made,  using  a  SINS'  or  EHJ5'  upstream 
primer,  respectively.  RT-PCR  amplified  cDNA  was  gel  purified,  and  ligated  into  the  modified 
EcoRI  site  of  the  pCRII  vector  supplied  with  the  TA-cloning  system™  (Invitrogen,  San  Diego, 
CA).  The  recombinant  plasmid  was  transformed  into  E.  coli  InvoF’  competent  cells  as 
described  (Invitrogen,  San  Diego,  CA).  Isolates  were  screened  by  restriction  mapping  rapid 
plasmid  preparations  and  by  DNA  sequencing  using  the  T7  and  Ml 3  reverse  primers 
(Invitrogen,  San  Diego,  CA). 

RESULTS 

HJ  AND  WEE  LIBRARY  SCREENING 

Plaque  assays  of  the  batches  of  WEE  71V-1658  grown  at  USAMRIID  were  completed  (data 
not  shown).  The  titre  of  the  10%  SMB  of  WEE  71V-1658  was  6.1  X  10^  pfu/ml.  The  seed 
stock  was  4.1  X  10^  pfu/ml.  The  quality  of  cDNA  libraries  is  dependent  foremost,  on  the 
integrity  of  the  RNA  starting  material.  RNA  gels  of  HJ  virus  strain  B320  and  the  initial  batch 
of  WEE  7 IV- 165 8  (CDC)  indicated  some  degradation  in  the  samples  had  occurred  (data  not 
shown).  This  is  reflected  in  the  large  numbers  of  small  inserts  (<1000  bp)  encountered  when 
screening  isolates  of  HJ  library  for  insert  size  (see  Fig.  3  and  Table  2).  This  was  surprising  since 
Invitrogen  indicated  that  the  cDNA  synthesis  had  attained  an  excellent  size  distribution, 
ranging  from  100  bp  to  >8,000  bp.  In  addition,  the  cDNA  was  size  selected  (>2,000  bp)  prior 
to  ligation  and  transformation  into  the  pcDNA  II  vector.  In  any  event,  we  used  oligonucleotide 
probes  to  the  HJ  3'  end  (Ou  et  u/,1981)  and  the  promoter  region  (WEEPRO  primer)  of  WEE 
(see  Table  1).  As  genomic  sequence  information  for  HJ  was  limited  to  the  3'  and  5'  ends,  we 
had  speculated  diat  if  HJ  and  WEE  were  related,  the  26S  promoter  area  could  show 
conservation  of  the  genome  sequence  and  cross-reactivity  to  HJ.  A  new,  nonradioactive 
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procedure  (E-link)  was  used  to  label  HJ3'  and  WEEPRO  oligonucleotide  probes  to  screen 
isolates  of  interest  (26S  region).  However,  better  hybridization  results  were  acheived  using 
probes  conventionally  labelled  at  the  5'  end  of  the  oligonucleotide  with  y-[^^P]ATP  and 
poljoiucleotide  kinase.  Nevertheless,  over  100  isolates  were  screened  by  restriction  mapping 
(see  Fig.  3)  and  oligonucleotide  probe  reactivity  (see  Table  2).  The  size  distribution  of  the 
inserts  was  as  follows:  0-500  bp  -  32%;  500-1000  bp  -35%;  1000-2000  bp  -  26%;  and  >2000 
bp  -  7%.  Non-reactivity  with  the  probes  would  only  indicate  that  the  isolate  did  not  carry  an 
insert  that  extended  into  the  limited  target  region  of  the  probe,  and  could  not  be  interpreted  as 
the  isolate  carrying  no  HJ  DNA.  A  number  of  HJ  clones  were  identified  for  DNA  sequence 
analysis  including:  A19,  A24,  A35,  B23,  C12,  D1  and  D2).  Further  work  was  not  completed 
with  Highlands  J  isolates,  as  the  emphasis  shifted  to  the  WEE  strain  7 IV- 1658  library. 

As  previously  mentioned,  the  initial  batch  of  WEE  71V-1658  showed  degradation  by  RNA 
formaldehyde  gel  electrophoresis  (data  not  shown),  however,  subsequent  batches  of  genomic 
RNA  of  WEE  71V-1658  were  very  good  (aliquots  stored  in  ethanol  at  -70°C).  Sequence  data 
was  available  to  the  entire  26S  region  of  WEE  strain  BFS  1703  (Hahn  et  al,  1988)  and  was 
used  to  design  a  set  of  primers  that  could  be  used  for  PCR  amplification,  as  well  as  DNA 
sequence  analysis  of  WEE  strain  71V-1658  (see  Table  1).  The  fragments  generated  by  PCR 
were  designed  to  overlap  at  tmique  restriction  sites,  and  allowed  PCR  generated  fragments  to 
be  substituted  into  the  WEE  26S  clones.  This  was  used  to  our  advantage  in  the  subsequent 
genetic  engineering  of  a  full-length  26S  clone,  as  a  complete  clone  could  not  be  isolated  from 
the  Invitrogen  library.  Rapid  plasmid  isolates  were  screened  by  dot  blot  analysis  using 
WEEPROAVEEPO,  WEEN5AVEEP5,  WEEN7AVEEP7  and  WEENOAVEE3'  PCR  generated 
fragments  as  probes  (see  Fig.  4).  The  results  from  dot  hybridization  are  summarized  in  Table 

3.  Dot  blot  hybridization  was  used  to  quickly  size  inserts  in  the  WEE  library.  The  PCR  probes 
selected  were  spaced  along  the  26S  gene  segment.  We  theorized  that  a  full-length  26S  insert 
would  hybridize  with  all  foxir  PCR  probes,  while  smaller  the  inserts  would  only  hybridize  to 
one  or  two  probes.  No  isolates  were  identified  which  hybridized  to  all  four  probes,  but  a 
number  of  the  largest  isolates  hybridized  to  three  of  the  four  probes.  These  included  pcDW-1, 

4,  6,  7  and  12  (data  not  shown).  The  size  distribution  of  inserts  in  the  pcDNA  II  vector  was 
similar  to  that  obtained  with  the  HJ  library,  with  not  many  large  inserts  obtained  (>2,000  bp). 
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The  largest  clones  identified  were  pcDW-6  and  pcDW-12,  which  were  estimated  to  be  2900 
and  3100  bp,  respectively,  by  restriction  analysis  on  agarose  gels  (see  Table  3).  This  was  still 
25-30  %  shorter  then  the  estimated  4100-4200  size  of  the  26S  region  of  WEE.  DNA  sequence 
analysis  confirmed  that  both  clones  pcDW-6  and  pcDW-12  were  truncated  from  the  5'  end  of 
the  WEE  26S  region  at  nucleotide  (nt)  1798  and  nt  1446  (BPS  1703  numbering  system  -  Hahn 
et  al,  1988),  respectively.  However,  since  the  WEE  sequencing  primers  were  designed  to 
produce  fragments  which  terminated  at  unique  restriction  sites  (BPS  1703  WEE  26S  sequence 
-  Hahn  et  al,  1988),  a  strategy  was  developed  to  place  a  PCR  synthesized  5'  fragment  onto  the 
end  of  pcDW-12  as  outlined  in  Pig.  1. 

CONSTRUCTION  OP  THE  PULL-LENGTH  26S  REGION  OP  WEE 

RT-PCR  was  used  to  synthesize  the  WEE  26S  5'  end  using  the  WEE5'Sst  and  the  WEEP3 
primers.  A  fragment  of  the  expected  size  (1800  bp)  was  isolated  from  agarose  gels  and  ligated 
into  the  Sstl  and  Ncol  sites  of  phT3T7BM+  (data  not  shown).  A  number  of  recombinant 
plasmids  containing  the  1800  bp  WEE  5'  end  were  screened  by  restriction  analysis  of  rapid 
plasmid  preparations,  and  confirmed  by  fine  restriction  mapping  and  DNA  sequence  analysis 
(phTW5'-3, 4  and  6  -  data  not  shown)  Cloning  into  phT3T7BM+  allowed  us  to  excise  the  same 
1800  bp  fragment  with  the  addition  of  a  Xbal  site  at  the  5'  end.  The  Xbal/Ncol  fragment  of 
phTW5'-3  was  ligated  into  the  Xbal/Ncol  sites  of  pcDW-12  and  full-length  26S  region  clones 
(XH-7,  12)  were  screened  for  by  restriction  analysis  of  rapid  plasmid  preparations  (Pig.  5). 
XH-7  was  fine  mapped  (data  not  shown)  and  demonstrated  an  almost  identical  restriction 
fragment  pattern  to  WEE  strain  BPS  1703  (Hahn  et  al,  1988). 

DNA  SEQUENCE  ANALYSIS  OP  THE  26S  REGION  OP  WEE 

XH-7  was  completely  DNA  sequenced  using  a  combination  of  manual  (see  Pig.  6)  and 
automated  sequencing  (see  Pig.  7).  The  manual  sequencing  was  very  time  consuming  when 
compared  with  the  ABI  (Poster  City,  CA)  automated  sequencer.  Manual  sequences  were 
analysed  on  the  Speed  Reader™  (Intelligenetics,  Mountain  View,  CA).  While  being  an 
improvement  over  the  visual  reading  of  autoradiographs  of  DNA  sequencing  gels,  it  is  still  a 
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slow  and  tedious  process  as  compared  to  automated  fluorescent  systems.  A  complete  DNA 
sequence  was  obtained  for  clone  XH-7  (see  Appendix  A)  using  the  primers  listed  in  Table  1. 
In  the  nucleotide  sequence,  63  nt  changes  were  detected  ,  with  only  1 1  causing  amino  acid 
substitutions  from  WEE  BPS  1703  (see  Fig.  8).  The  63  nt  changes  were  generally  scattered 
throughout  the  26S  gene  segment  as  follows:  5'  noncoding  region  - 1;  NC  -  16;  E3  -  2;  E2  -15; 
6K  -  1;  El  -  19,  3'  noncoding  region  -  9.  The  26S  promoter  (101-124)  and  the  tandem  repeats 
(3896-3935,  3954-3993)  in  the  3'  noncoding  region  were  conserved  (1  nt  change  to  make  it  an 
exact  repeat  in71V-1658  -  Appendix  A).  The  XH-7  construct  had  a  continous  open  reading 
frame  beginning  and  terminating  at  the  same  amino  acid  residues  as  strain  BPS  1703 
(Appendix  B).  The  amino  acid  sequence  was  highly  conserved,  with  10  of  the  1 1  amino  acid 
differences  being  conservative  changes.  Only  the  Cys  to  Arg  (28)  in  the  NC  protein  would  be 
considered  nonconserved  (see  Fig.  8).  The  XH-7  clone  was  truncated  at  the  3'  end  of  the 
genome  with  respect  to  the  BPS  1703  sequence.  This  is  likely  a  deletion,  as  restriction  analysis 
of  the  pcDW-12  as  compared  with  pcDW-6  and  7,  showed  a  smaller  band  mapping  to  the  3' 
end  (1600  bp  instead  of  1700  bp  for  pcDW-6  and  7).  In  addition,  the  poly  A  tract  at  the  3'  end 
was  completely  missing.  To  confirm  this,  the  3'  ends  of  both  pcDW-6  and  pcDW-7  could  be 
sequenced.  Further  full-length  cDNA  constructs  to  WEE  strain  7 IV- 1658  were  made  using 
primers  WEE5'Sst  and  WEE3'Hindin  in  a  RT-PCR  reaction.  A  set  of  XTRAH  clones  were 
isolated,  and  XTRAH-5  was  characterized  by  restriction  mapping  (data  not  shown)  as 
containing  the  entire  26S  coding  sequence. 

ANALYSIS  OF  THE  5'  WEE  NONSTRUCTURAL  REGION 

One  of  the  approaches  initially  examined  for  vaccine  research  was  to  obtain  an  infectious  clone 
for  mutagenesis  studies.  In  order  to  accomplish  this  task,  the  entire  genome  of  WEE  must  be 
cloned.  Sequence  information  was  available  to  the  middle  of  the  nonstructural  region  (W eaver 
et  al,  1993),  however,  no  sequence  information  was  available  for  the  5'  end  of  the  nonstructural 
region.  Nevertheless,  it  was  deduced  that  the  5'  termini  from  a  number  of  alphaviruses  were 
highly  conserved  (Ou  et  a/,1983),  and  this  information  was  used  to  devise  a  cloning  strategy 
for  the  region  (see  Fig.  2).  Two  primers  were  made  to  the  5'  termini,  one  based  on  the  SIN 
sequence  (SIN5')  and  the  other  based  on  a  combination  of  the  EEE  and  HI  sequences  (EHJ5'). 
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RT-PCR  reactions  were  carried  out  using  three  sets  of  PCR  primers  as  indicated  in  Fig.  2,  to 
generate  3  overlapping  fragments  for  reconstruction  of  the  WEE  nonstructural  region.  For 
amplification  of  the  5'  nonstructural  gene  fragment  (WNS-A),  the  EHJ5'  primer  worked  well 
(see  Fig.  9),  generating  a  band  of  the  expected  MW.  This  is  in  contrast  to  the  SINS'  primer 
which  generated  only  a  multitude  of  smaller  sized  bands,  of  which  many  were  eliminated  when 
Perfect  Match™  was  added  to  the  reaction  mix  (see  Fig.  9).  In  general,  Perfect  Match™ 
increased  the  stringency  of  the  PCR  reaction,  eliminating  some  extraneous  bands.  Following 
gel  purification  and  ligation  of  the  WNS-A,  WNS-B  and  WNS-C  fragments,  transformant 
colonies  were  identified  by  colony  hybridization,  and  rapid  plasmid  preparations  were 
restriction  analyzed  (data  not  shown).  Isolates  of  the  proper  size  were  identified  for  WNS-A 
(WNSTA-3,  4,  5  &  6)  and  WNS-C  (WNSTC-1,  2,  3  &  4)  fragments.  These  were  sequenced 
with  the  T7  and  Ml 3  reverse  primers  from  the  pCRII  multicloning  region. 

The  clustal  method  of  homology  comparison  was  used  for  both  DNA  and  protein  sequences  in 
the  Lasergene™  program.  The  WNSTA-3  5'  DNA  sequence  showed  high  homology  to  HJ 
(Ou  et  al,  1983),  EEE  North  American  (NA)  (Weaver  et  al,  1994)  and  EEE  South  American 
(SA)  (Ou  et  al,  1983)  as  illustrated  in  Fig.  10a  and  Appendix  C.  Homology  to  SIN  strain  HR 
was  more  distant  (data  not  shown),  as  illustrated  in  the  phylogenetic  tree  (see  Fig.  10b).  The 
First  19  nt  of  WNSTA-3  are  derived  from  the  EHJ5'  primer  and  were  not  included  in  the 
homology  calculations,  as  they  may  not  correspond  exactly  to  the  WEE  71 V-1658  5'  end  of  the 
genome.  If  the  corresponding,  translated  open  reading  frames  of  WNSTA-3,  HJ,  EEE-NA, 
EEE-SA  and  SIN  are  compared,  fairly  high  homology  is  obtained  for  WEE,  HJ,  EEE-NA  and 
EEE-SA,  as  shown  in  Fig.  11a.  Structures  which  were  highly  conserved  in  all  alphaviruses, 
such  as  a  51  nt  sequence  (ntl38-188)  and  a  22  amino  acid  sequence  (residues  34-55)  (Ou  etal, 
1983)  were  also  present  in  the  WEE  sequence.  The  3'  end  of  WNSTA-3  was  sequenced  for  233 
nt,  and  a  comparison  to  the  BFS-1703  sequence  (Weaver  et  al,  1994)  using  the  Clustal  method 
yielded  very  high  homology. 
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DISCUSSION 

When  vaccine  research  on  WEE  was  formulated  at  the  start  of  the  project,  Highlands  J  strain 
B320  was  included,  as  its  crossreactivity  and  reduced  virulence  in  humans  made  it  a  good 
vaccine  candidate.  Previous  studies  had  postulated  that  HJ  was  a  variant  of  WEE  based  on  its 
antigenic  cross-reactivity  (Hayes  and  Wallis,  1977;  Karabatsos  et  al,  1988)  and  it  was  referred 
to  as  the  eastern  strain  of  WEE  virus  in  the  early  literature.  A  cDNA  library  to  HJ  strain  B320 
was  produced  containing  a  number  of  isolates  whose  inserts  were  reactive  with  the  HJ  probe 
(see  Table  2).  However,  with  the  limited  sequence  information  available  and  the  priority  to 
produce  a  WEE  specific  subunit  vaccine,  the  research  focus  was  shifted  to  WEE.  Nevertheless, 
in  order  to  glean  further  information  on  the  pathogenesis  of  WEE,  it  would  be  of  interest  to 
define  how  HJ  virus  is  related,  through  DNA  sequence  analysis. 

The  WEE  library  was  disappointing  in  that  a  full-length  clone  of  the  26S  region  could  not  be 
obtained.  The  largest  clones  were  estimated  to  be  2900  and  3100  bp  respectively,  for  pcDW- 
6  and  pcDW-12.  The  longer  clone,  pcDW-12  was  chosen  to  serve  as  the  starting  material  for 
engineering  of  the  full-length  26S  region  (Fig.  1).  Utilizing  the  advances  in  PCR  cloning,  the 
5'  end  of  phTW5'-3  was  fused  at  the  Ncol  site  to  the  bulk  of  pcDW-12  to  form  the  XH-7  clone. 
A  set  of  monoclonal  antibodies  to  WEE  strain  B1 1  have  been  made  and  partially  characterized 
by  ELISA  and  western  blot  analysis  (impublished  results).  These  were  reactive  to  inactivated 
WEE  71 V-1658  in  western  blots,  and  will  be  valuable  in  defining  levels  of  protein  produced  in 
vitro  (tissue  culture,  baculovirus,  vaccinia)  or  in  vivo  (DNA  immunization).  The  restriction 
map  of  71  V-1658  was  highly  conserved  when  compared  to  BFS  1703  and  it  could  be  used  to 
restriction  map  isolates  and  directional  clone  (cloning  using  two  different  restriction  sites)  PCR 
fragments. 

One  technological  change  which  had  an  immense  impact  on  the  outcome  of  the  project  was  the 
switch  in  DNA  sequencing  from  a  manual  method  to  an  automated  fluorescent  method.  The 
amoimt  of  sequence  data  that  can  be  generated  by  the  fluorescent  sequencers  is  astounding.  A 
year’s  worth  of  work  could  be  easily  cut  down  to  one  or  two  months.  The  need  for  radioistope 
was  eliminated  and  samples  were  easier  to  set  up,  run  and  analyze.  The  raw  sequence  data 
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(ABI  chromatograms)  could  be  directly  imported  into  the  Power  Macintosh-based 
Lasergene™  software  for  evaluation. 

The  WEE  virus  strain  71V-1658  was  selected  by  Dr  Trent  as  a  vaccine  isolate,  as  it  was  a 
relatively  recent  isolate  of  WEE.  The  surprisingly  high  degree  of  homology  between  the  two 
strains  of  WEE  is  extended  to  the  nucleotide  sequence,  as  shown  by  homology  comparison  in 
DNASTAR  to  the  BPS  1703  sequence  (see  appendix  A).  The  majority  of  the  63  nt  changes 
were  silent  mutations.  Out  of  eleven  amino  acid  changes  in  the  polyprotein,  10  produced 
conservative  amino  acid  changes,  and  only  one  at  residue  28  (Cys  Arg)  could  be  considered 
radical.  The  viruses  were  isolated  around  the  same  time  period,  however,  this  high  degree  of 
conservation  was  also  noted  with  a  South  American  isolate,  WEE  strain  CBA  (personal 
commimication  -  M.  Parker,  USAMRIID).  In  addition,  the  N-terminal  regions  of  the  El  and 
E2  proteins  were  also  conserved  when  compared  to  the  McMillan  strain  of  WEE  which  was 
isolated  in  1957  (Bell  et  al,  1983).  From  over  60  residues  which  were  determined  for  the  N- 
termini  of  each  of  El  and  E2  proteins  of  the  McMillan  strain,  only  3  amino  acid  changes  were 
noted  for  71 V-1658  (see  Fig.  8).  With  these  indications  that  the  amino  acid  sequence  is  highly 
conserved  amongst  WEE  strains,  one  strain  of  WEE  should  be  able  to  provide  protection  to 
most  of  the  WEE  isolates.  While  the  3'  noncoding  region  of  XH-7  was  deleted  (21  nt),  it  should 
not  affect  the  expression  of  the  structural  genes,  especially  in  constructs  which  contain  a  poly- 
A  tailing  and/or  a  transcription  terminator  signal. 

The  5'  end  of  the  WEE  genome  was  surprisingly  well  conserved  when  compared  with  the  5' 
end  of  the  HJ  or  EEE  genomes  (see  Fig.  10a).  It  was  previously  hypothesized  that  WEE  was 
a  recombinant  virus,  derived  from  EEE  and  a  SIN  -like  virus,  with  the  nonstructural  region  and 
capsid  protein  derived  from  EEE  (Hahn  et  al,  1988).  This  data  confirms  that  the  5'  end  of  WEE 
is  derived  from  EEE  and  not  from  a  SIN  ancestor  (see  Fig.  lOb  and  I  Ib).  The  limited  HJ  virus 
sequence  was  also  highly  conserved  in  this  region  of  the  genome,  however,  more  sequence  data 
for  HJ  will  have  to  be  obtained  to  draw  definitive  conclusions.  The  2.2  kb  WNS-A  fragment 
represents  a  part  of  the  WEE  genome  that  has  not  been  examined  to  date.  In  the  course  of  the 
research,  an  infectious  clone  approach  had  been  dropped  in  favor  of  a  subunit  vaccine  approach 
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(DNA  immunization,  baculo virus  or  yeast  expression).  However,  it  would  be  of  interest  to 
complete  the  sequencing  of  the  WNS-A  fragment  from  a  scientific  point  of  view. 

CONCLUSION 

A  full-length  clone  for  the  26S  region  of  WEE  strain  71V-1658  has  been  obtained  and 
completely  sequenced.  Only  eleven  of  the  sixty-three  nucleotide  changes  resulted  in  amino 
acid  changes  when  the  sequence  was  compared  to  WEE  strain  BPS- 1703.  The  high  degree  of 
conservation  of  the  structural  proteins  was  maintained  when  compared  to  the  N-terminal 
regions  of  the  El  and  E2  envelope  proteins  of  theWEE  McMillan  strain.  The  conserved  nature 
of  the  structural  proteins  would  indicate  that  one  strain  of  WEE  should  be  able  to  cross-protect 
against  all  WEE  strains.  Furthermore,  the  5'  end  of  the  genome  was  cloned  as  a  2.2  kb  fragment 
and  sequenced  for  247  nt  on  5'  end  and  233  nt  on  3'  end  of  the  fragment  (WNS-A).  It 
demonstrated  high  homology  to  both  HJ  and  EEE,  adding  further  evidence  that  the  entire  5' 
nonstructural  region  of  WEE  was  derived  from  EEE.  This  data  also  demonstrates  the  close 
relationship  between  HJ  and  WEE  at  the  genetic  level,  which  was  previously  implied  by 
antigenic  cross-reactivity.  In  summary,  the  genetic  cloning  and  sequencing  of  the  WEE  26S 
region  marks  the  critical  first  step  in  the  generation  of  a  subunit  vaccine  to  WEE. 
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Table  1:  WEE  26S  REGION  PRIMERS 


✓ 

• 

Name 

Length  Sequence 

Note 

✓ 

HJ-3' 

30 

GCATTATGCA.CTGTGCTTCCTCGGAAGTAC 

Highlands  J  3 '  end 

WEEPRO 

30 

AATCACCCTCTACGGCTGACCTAAATAGGT 

WEE 

26S 

96-12S,  +  STRAND  PROMOTOR 

>/ 

WEEPR-SST 

24 

GGCTGAGCTCAATAGGTGACGTAG 

WEE 

26S 

109-132,  + 

STRAND  SstI  SITE 

✓ 

• 

WEE3  ' 

30 

GTAGTGTATATTAGAGACCCATAGTGAGTC 

WEE 

26S 

4086-4057, 

-  STRAND 

✓ 

WEES ' Sst 

20 

TCCAGATACGAGCTCATACT 

WEE 

26S 

1-20,  +  STRAND  SstI  SITE 

✓ 

WEENl 

20 

GGTGCCGCTGGAGGCCGTTT 

WEE 

26S 

238-257,  + 

STRAND 

✓ 

WEENIA 

20 

GATCTTAGGAGGTCGATAGC 

WEE 

26S 

285-304,  + 

STRAND 

✓ 

• 

WEEN2 

20 

GGCTGATGAAACCACTCCAC 

WEE 

26S 

547-566,  + 

STRAND 

✓ 

• 

WEEN3 

20 

CCACCCGTGTGCTATTCACT 

WEE 

26S 

990-1009,  + 

STRAND 

>/ 

• 

WEEN3A 

20 

CGCCGTGTTTCAGCCCAATA 

WEE 

26S 

990-1009,  + 

STRAND 

✓ 

• 

WEEN4 

20 

TCACGAGCGGAGCATCTGAG 

WEE 

26S 

1459-1478, 

+  STRAND 

✓ 

• 

WEENS 

20 

GGCATCACCCTCCACCTGAC 

WEE 

26S 

1959-1978, 

+  STRAND 

✓ 

• 

WEEN6 

20 

TTGTTATTCTGTTCCGCTGC 

WEE 

26S 

2469-2488, 

+  STRAIN 

• 

WEEN7 

20 

CTATTGATCATGCAGTCGCA 

WEE 

26S 

2893-2912, 

+  STRAND 

✓ 

• 

WEENS 

20 

AGTGGAGCCTCTGCGAGCGT 

WEE 

26S 

3335-3354, 

+  STRAND 

✓ 

• 

WEEN9 

20 

GAGGAGTGGGCGGGAAAGGC 

WEE 

26S 

763-782,  + 

STRAND 

✓ 

• 

WEENl 0 

20 

CTAAAACTCGATGTATTTCC 

WEE 

26S 

3952-3971, 

+  STRAND 

✓ 

• 

WEENl 1 

20 

ACGCGAACGAAGATGAACGG 

WEE 

26S 

1752-1771, 

+  STRAND 

✓ 

• 

WEEN12 

20 

ACTGTCATTGTGCTGTGTGG 

WEE 

26S 

2214-2233, 

+  STRAND 

✓ 

• 

WEENl 3 

20 

CACAGTCATTCCTTCACCAC 

WEE 

26S 

2705-2724, 

+  STRAND 

✓ 

• 

WEENl 4 

20 

CGTCATCAGAAAGGGGCTTG 

WEE 

26S 

3080-3099, 

+  STRAND 

✓ 

• 

WEENl S 

20 

CAAAGCTGACAGGGAGGGAC 

WEE 

26S 

3509-3528, 

+  STRAND 

✓ 

• 

WEENl  6 

20 

GGAAAGCTGGTAAAGTGCCA 

WEE 

26S 

1554-1573, 

+  STRAND 

✓ 

WEENO 

20 

GGAGAACCACATAAAGTCGA 

WEE 

26S 

3714-3733, 

+  STRAND 

✓ 

WNSPl 

25 

GGCTAACGTGGACAGGGACGTGATG 

WEE 

26S 

57-33, 

-  STRAND 

✓ 

• 

WEEPO 

20 

GGCTATCGACCTCCTAAGAT 

WEE 

26S 

305-286, 

-  STRAND 

✓ 

• 

WEEPOA 

20 

CTGTCGGTTCCCTGGTTTAG 

WEE 

26S 

458-439  - 

■  STRAND 

✓ 

• 

WEEPl 

20 

CTGGGGAACGTCGCCATACT 

WEE 

26S 

656-637,  - 

“  STRAND 

✓ 

• 

WEEP2 

20 

CGTTCTCCAGCAGCGTGTCG 

WEE 

26S 

1083-1064, 

-  STRAND 

✓ 

• 

WEEP2A 

20 

TATTGGGCTGAAACACGGCG 

WEE 

26S 

1202-1183, 

-  STRAND 

✓ 

WEEP3 

20 

CTTCAAGTGATCGTAAACGT 

WEE 

26S 

1592-1573, 

-  STRAND 

✓ 

• 

WEEP4 

20 

ACTCCAGCCCTTCTCGCCCC 

WEE 

26S 

2097-2078, 

-  STRAND 

✓ 

WEEPS 

20 

GTTCGACCAACGCCTTATAC 

WEE 

26S 

2610-2591, 

-  STRAND 

✓ 

• 

WEEP6 

20 

AAGGGTGAAAAAGCGGCTGA 

WEE 

26S 

3067-3048, 

-  STRAND 

✓ 

WEEP7 

20 

GGTGATTCTGATGATCTCAC 

WEE 

26S 

3430-3411,  -  STRAND 

✓ 

• 

WEEPS 

20 

TGGAAACTGCCGCCTGGAAT 

WEE 

26S 

3759-3740, 

-  STRAND 

✓ 

• 

WEEPIO 

20 

CCTTGATGTCATGGTCGTGG 

WEE 

26S 

1344-1325, 

-  STRAND 

✓ 

• 

WEEPll 

20 

TGCACTGAGTGGTCTGTGTG 

WEE 

26S 

1870-1851, 

-  STRAND 

✓ 

• 

WEEP12 

20 

ATGTTTCAGCGTTGGTTGGC 

WEE 

26S 

2391-2372, 

-  STRAND 

✓ 

WEEP13 

20 

GTGTTCTCACTGTCACAGAA 

WEE 

26S 

2851-2832, 

-  STRAND 

✓ 

• 

WEEP14 

20 

ATGTGTGGTCGCTTCCTTCA 

WEE 

26S 

3584-3565, 

-  STRAND 
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Table  2:  SUMARY  OF  HJ  ISOLATES 


HJ-.r.  (WLI;!»R0)  FROBKS  •  N-I .ink.  .^2I».HYBONn-N  SIZl-  -  SI/I-  OI-  INS1:RT  A1TI:R  1:XCISI0N  with  XHOI  VO  |’R0S1*I:CTS  -  aw.  A24.  A.^5.  02.V  C12.  Ol.  D2 

^21’  HYB()NI)-C.  RI;S1*ECT1V1-I.Y  AND  HINDIII  Ct(X)D  I’ROSlMiCTS  •  Al.^.  AI.^.  A22.  A.^0.  B I.L  B24.  C19. 

Nl)  -  NOT  I)I:TI:RMINI-:D  C2(I.  dm.  D22.  D29. 1X1?.  D4I.  D42 
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WEES’SstI 


Ncol 


P 


WEEPS 
Sstl/Ncol  Digest 


Sst 


Mjal 

issti 

“hc 


phTW5’-3 


Ncol 

=] 


ligate  into  phT3T7BM+ 


Xbal/Ncol  Digest 


Ncol 


1800  bp  Xbal/Ncol 
fragment  phTW5’-3+ 


Xbal  Ncol 


Hindlll 


pcDW12 


Xbal 

^Sstl 


1 

Ncol 

XH-7 


Hindlll 

H- 


Figure  1.  Construction  of  the  Full-length  WEE  26S  Region 

The  5'  end  of  the  26  S  region  of  was  constructed  using  RT-PCR  with  the  WEE5'SstAVEEP3 
primers  as  described.  An  1800  bp  fragment  was  cloned  into  Sstl/Ncol  sites  of  phT3T7BM+. 
The  Xbal/Ncol  fragment  (from  phTW5'-3)  was  cloned  into  the  Xbal/Ncol  sites  of  pcDW-12  to 
produce  the  full-length  WEE  26S  region  isolate,  XH-7. 
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a)  r 


0  (kb)  1 


SIN5’- 

EHJ5’- 


-WNSP3 


WNSN2- 


-WNSP2 


WNSN3 


WNS-A 


-WNSPl 


WNS-B 


WNS-C 


SINS' 

(1-19) 

EHJ5' 

(1-19) 

WNSN2 

(811-830) 

WNSN3 

(3626-3645) 

WNSP3 

(57-33) 

WNSP2 

(3865-3846) 

WNSPl 

(1009-990) 

ATTGGCGGCG  TAGTACACA 
ATAGGG[C/T][A/G]TGGT[A/G]TAGAG[G/T] 
AGCAGCACCT  CACAAAGTCC 
CGACTATGAC  AGGCGGTTTG 
GGCTAACGTG  GACAGGGACG  TGATG 
GGAGCAGTAT  TGAAGTCTTG 
TAGAAGCACT  GAATCAACAG 


Figure  2.  PCR  Amplification  of  WEE  Nonstructural  Region 


a)  The  RT-PCR  cloning  strategy  is  shown  for  WNS-A,  B  and  C,  together  which  encompass  the 
entire  nonstructural  region  of  WEE.  Primers  used  are  indicated,  b)  The  actual  sequence  of  each 
of  the  primers  is  indicated,  with  the  location  in  brackets:  SIN5'  and  EHJ5'  from  Ou  et  al,  1983; 
WNSN2,  WNSN3,  WNSPl  and  WNSP2  based  on  Weaver  et  al,  1994;  and  WNSP3  from  Hahn, 
et  al,  1988. 
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Figure  3.  Restriction  Analysis  of  HJ  Isolates 

Rapid  plasmid  preparations  of  HJ  isolates  were  made  up  as  described  with  the  exception  that 
a  separate  RNase  A  reaction  was  carried  out  before  the  final  phenol/chloroform  extraction  and 
ethanol  precipitation  was  completed.  The  washed  and  dried  pellet  of  DNA  was  dissolved  in 
120  lof  dH20.  The  restriction  digest  mixture  consisted  of  18  IDNA,  2  11  Ox  reaction  buffer 
(500  mM  Tris-HCl,  pH8.0,  100  mM  MgCl2,  500  mM  NaCl)  and  2  lof  Hindlll/Xhol  (2U  of 
each)  stock  diluted  in  lx  reaction  buffer.  The  tubes  were  incubated  3  hr  at  37°C.  The  samples 
were  run  on  a  1  %(w/v)  TBE  agarose  gel.  Lanes:  1-12)  HJD-1  to  12;  13)  A,  DNA/Hindlll 
digested  MW  marker;  14)  OXIVd  RF  DNA/Haelll  digested  MW  marker. 
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Figure  4.  Dot  Blot  Hybridization  of  WEE  Isolates 

Rapid  plasmid  preparations  of  isolates  from  the  WEE  cDNA  library  (Invitrogen,  San  Diego, 
CA)  were  made  up  as  described.  Two  lof  each  sample  were  spotted  to  a  nylon  membrane 
(Hybond-N™  -  Amersham,  Oakville,  Ont.).  Samples  were  air  dried,  UV  fixed  and  hybridized 
with  a  Random  primer  [^^P]-labelled  probe  of  the  WEEN0AVEE3'  PCR  fragment  as  described. 
Samples:  Al-12)  pcDW  1-12;  Bl-12)  pcDW  13-24;  Cl-12)  pcDW  25-36;  Dl-12)  pcDW37- 
48;  El-12)  pcDW  49-60;  Fl-12)  pcDW  61-72;  Gl-12)  pcDW  73-84;  Hl-12)  pcDW  85-96;  II- 
12)  pcDW  97-108;  Jl-12)  pcDW  109-120;  Kl-8)  pcDW  121-128;  K9)  negative  control  (k 
DNA/Hindlll  digested);  KIO)  positive  control  (WEE  71V-1658  cDNA). 
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Figure  5.  Restriction  Mapping  of  WEE  Isolates 

Large-scale  plasmid  preparations  were  made  to  XH-7  and  XH-12.  Samples  were  digested  with 
restriction  enzymes  as  described  and  run  on  a  1  %(w/v)  TBE  agarose  gel.  Lanes:  1)  X,  DNA/ 
Hindlll  digested  MW  marker;  2)  0X174  RF  DNA/Haelll  digested  MW  marker.  XH-7  digested 
with:  4)  Xbal/Hindlir,  5)  Xbal/Xhol-,  6)  Xbal/Ncol,  7)  Bglll/Ncol,  8)  Sstl.  XH-12  digested 
with:  9)  Xbal/Hindlll;  10)  Xbal/Xhol;  11)  Xbal/Ncol;  12)  Bglll/Ncol;  13)  Sstl. 
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a) 


AGGAAGTTTG  TCGGTAGAGA 
GGAGTACTTG  TTCCCACCCG 
TCCATGGAAA  GCTGGTAAAG 
TGCCACGTTT  ACGATCACTT 
GAAGGAGACG  TCTGCCGGGT 
ACATAACCAT  GCACAGGCA 
G..?’ 


Figure  6.  Manual  DNA  Sequence  Analysis 

Manual  DNA  sequencing  was  run  as  described,  on  a  8%(w/v)  TBE  gel  (40  cm).  The  sequence  is 
read  sequentially  from  the  bottom  of  the  gel,  using  a  set  of  four  lanes.  The  sequence  shown  in 
a)  used  the  WEEN4  primer  to  sequence  WEE71V-1658  construct,  XH-7;  b)  autoradiograph  of  the 
sequencing  gel  with  the  deduced  sequence  (5'-AGGAAGTTTGTCGGTAGAGAGAGGAG. . .) 
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Figure  7.  Fluorescent  DNA  Sequence  Analysis 


Automated  sequencing  was  performed  using  a  Applied  Biosystem  Instruments  (ABI)  373 
automated  fluorescent  sequencer,  and  the  ABI  Prism™  Dye  Terminator  Cycle  Sequencing  kit 
(Perkin  Elmer,  Foster  City,  CA)  as  described  in  Material  and  Methods.  A  chromatogram  of  DNA 
sequence  data  generated  from  WNS-TA3  using  the  Ml  3  reverse  primer  is  shown  on  the  following 
page. 
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Position^ 

NT  Change^ 

Amino  Acid  Change^ 

Protein 

240 

TgC 

CysgArg(28) 

NC 

304 

CgT 

Ala  g  Val  (49) 

NC 

424 

AgG 

Lys  g  Arg  (89) 

NC 

907 

AgG 

Lys  gArg  (250) 

NC 

1011 

GgA 

Ala  g  Thr  (285) 

E3 

1182 

GgA 

Ala  g  Thr  (342) 

(Ala  in  McMillan  strain)'^ 

E2 

1358 

TgG 

Asp  g  Glu  (400) 

E2 

1984 

CgT 

Thr  g  Met  (609) 

E2 

2055 

AgT 

Thr  g  Ser  (633) 

E2 

2698 

GgA 

Arg  g  Lys  (847) 

(Lys  in  McMillan  strain)^ 

El 

3669 

AgT 

Thr  g  Ser  (1171) 

El 

Residue  361  (a.a.)  And  849  are  Thr  in  the  McMillan  strain,  but  a  Ser  (361)  and  His 
(849),  respectively,  in  71V-1658  &  BFS  1703^. 

^  Based  on  the  26S  sequence  of  WEE  strain  BFS  1703  (Hahn  et  al,  1988). 

^  Nucleotide  change  (strain  BFS  1703  g  71V-1658)  at  the  position  indicated. 

^  Amino  acid  change  (BFS  1703  g  71V-1658)  at  a.a.  residue  of  26S  polyprotein  ( ). 

^  Comparison  with  protein  sequence  generated  to  the  N-terminals  of  the  NC,  E2  &  El 
proteins  of  the  McMillan  strain  (Bell  et  al,  1983) 


Figure  8.  Structural  Protein  Homology  Between  WEE  Strains 

The  deduced  amino  acid  sequence  of  the  structural  proteins  of  WEE  strain  7 IV- 1658  was 
compared  to  the  deduced  amino  acid  sequence  of  BFS  1703  using  the  Clustal  method  of 
comparison  (Lasergene  sequence  homology  programs  -  DNA  Star,  Madison,  WI).  Silent 
mutations  were  not  included. 
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Figure  9.  WEE  5'  Nonstructural  Region 

RT-PCR  reaction  was  run  using  WEE  7 IV- 1658  RNA  as  template,  and  the  following  sets  of 
primers  with  and  without  Perfect  Match™  (Strategene,  La  Jolla,  CA).  a)  RT-PCR  reactions 
were  run  on  a  1%  preparatory  low  melting  agarose  gel.  Lane:  1)  X  Hindlll  digest;  2)  SIN57 
WNSP3;  3)  SIN5'AVNSP3  with  Perfect  Match™;  4)  EHJ5'AVNSP3;  5)  EHJ5’AVNSP3  with 
Perfect  Match™. 
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121  T  T  T  G  A  G  A  Tla  G  A  A  G  C  A  a|  GGCAGGTCACTGACAATGACCATG  TOSrA3-M13F!V 

1  I - - Ig  C  a  G  G  T  C  a  c1a1g0C  AATGACCATG  HJVOltrunc 

119  TTTGAGATAGAAGCAACGCAGGTCACTGACAATGACCATG  EEENA5  ‘  (WEAV) 
118  TTTGAGATAGAAG  00  A  CGCAGGTCA  00  G  AOAATGAOOATG  BEESADl 


oLcl  AATGOOAGAGOGTTTTOGOATGTGG  OL^  AOAAAGOTOAT  WNSIA3-M13RV 
0  T  A  A  T  G  00  AGAGOGTTTTOGO  AT  G  TGGOTAOAAAGOTOAT  HJVOltnmc 
OTAATGOTA  g[g]  G  OGTTTTOGO  aIc  0  T^G  OTA  o[t]a  A  G  0  T  0  A  T  EEE3SIA5'  (WEAV) 
OTAATGOTAGAGOGTTTTOGO  A0c|  TGGOTAOAA  aB  0  T  0  A  T  EEESAOl 


201  TGAGAGOGAAG  TO  GAOOGGGAOO  aB  G  T  T  A  T  0|t|T  G  G  A  0  A  T  T  WSIA3^M13RV 


64  0GAGAGOGAAG  T0  G  A  0  0  G  G  G  A 
199  T  G  A  G0  gS  G  a  a  G  T  G  G  aIt  A  0  aI  G  A 
198  0  G  aS  AGOGAAGTGG 


T  T  A  Tb_ 


■d  A  0  i - 1  HJVOltrunc 


G  A  0  0  A  G  G  T0  A  T  C^T  G~G  A0A  T  T  EEH>A5 '  (WEAV) 

EEESAOl 


EEENA5'(WEAV) 

EEESAOl 

WNSTA3-M13RV 

HJVOltrunc 

SINCG5' 


12  10  8  6  4  2  0 


Figure  10.  Nucleotide  Homology  In  the  WEE  5*  Nonstructural  Region 

a)  Sequence  comparison  of  5'  end  WEE  71 V-1658  (WSNTA-3)  with  the  HJ,  EEE  SA  (Ou  et 
al,  1983)  and  the  EEE  NA  (Weaver  et  al,  1994)  DNA  sequences  from  nt  120-240,  using  the 
Clustal  method;  b)  Phylogenetic  tree  of  the  DNA  sequence  comparison  of  5'  end  of  WEE,  HJ, 
EEE  SA,  EEE  NA  and  SIN  HR  (Strauss  et  al,  1984). 
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Figure  11.  Homology  of  the  N-terminus  of  Nonstructural  Protein  1 

a)  Comparison  of  the  translated  protein  sequence  of  WNSTA-3  sequence  (N-terminal  of  nspl) 
with  HJ,  EEE  SA  (Ou  et  al,  1983)  and  the  EEE  NA  (Weaver  et  al,  1994)  sequences  using  the 
Clustal  method;  b)  Phylogenetic  tree  of  the  sequence  comparison  of  the  N-terminus  of  nspl  of 
WEE,  HJ,  EEE  SA,  EEE  NA  and  SIN  HR  (Strauss  et  al,  1984). 
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Appendix  A  Alignment  of  the  26S  region  of  WEE  strain  7 IV- 1658  with  BPS- 1703  (Hahn, 
et  al,  1988).  The  WEE  sequence  begins  at  nt  14  and  ends  at  nt  4150.  Some  of  the  features  are: 
the  WEE  26S  promoter  (nt  101-124);  the  start  of  the  26S  RNA  (nt  120);  the  polyprotein  start 
codon  (nt  159);  the  termination  codon  (nt  3867);  and  the  tandem  repeats  (nts  3896-3935, 3954- 
3993). 
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450  460  470  480 

441  AAACCAGGGAAACGAC  aLa]  CGTATGTGTATGAAGTTGGAGT 
441  AAACCAGGGAAACGACAGCGTATGTGTATGAAGTTGGAGT 
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481  CGGACAAGACATTTCCGATCATGCTGAACGGCCAAGTGAA 
481  CGGACAAGACATTTCCGATCATGCTGAACGGCCAAGTGAA 
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T  G  G  A  T  A0  GCTTGCGTTGTCGGAGGAAGGCTGATGAAACCA 
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561  CTCCACGTTGAAGGAAAAATTGATAATGAGCAATTAGCGG 
561  CTCCACGTTGAAGGAAAAA  T0  GATAATGAGCAATTAGCGG 
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610  620  630  640 
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601  CCGTGAAATTGAAGAAGGCTAGCATGTACGACTTGGAGTA 
601  CCGTGAAATTGAAGAAGGCTAGCATGTACGACTTGGAGTA 


• - 1 - 1  I  1“ 

650  660  670  680 

641  [cIggcgacgttccccaga  a[c)  ATGAAATCAGACACGCTGCAG 
641  TGGCGACGTTCCCCAGAATATGAAATCAGACACGCTGCAG 
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681 


T 


690  700  710  720 

tacaccagcgacaaaccaccgggcttLcJtacaactggcacc 

TACACCAGCGACAAACCACCGGGCTTTTACAACTGGCACC 
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761  gagaggagtgggcgggaaaggcgacagtggaagaccgatc 
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801  CTGGACAACAGAGGCAGAGTTGTGGCTATTGTTCTAGGAG 
801  CTGGACAACAGAGGCAGAGTTGTGGCTATTGTTCTAGGAG 

— - - 1 - 1 - r 

850  860  870  880 
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841  GTGCAAATGAGGGCACGCGTACGGCGCTTTCAGTGGTCAC 

841  gtgcaaaSgagggcacgcgtacggcgctttcagtggtcac 
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881  T  T  GGAACCAGAAAGGGGTGACCATTAGGGATACCCCCGAA  71V-1658seql 
881  TTGGAACCAGAAAGGGGTGACCA  tQ  aS  GGATACCCCCGAA  BFS-1703seq 


921  GGTTCTGAACCGTGGTCACTAGTTACAGCGCtL^TGCGTGC  71V-1658seql 
921  GGTTCTGAACCGTGGTCACTAGTTACAGCGCTGTGCGTGC  BFS-1703seq 


961  TTTCGAATGTCACGTTCCCATGCGACAAACCACCCGTGTG  71V-1658seql 
961  TTTCGAATGTCACGTTCCCATGCGACAAACCACCCGTGTG  BFS-1703seq 


1001  C  T  A  T  T  C  A  C  T  gLa|  C  G  C  CAGAACGAACACTCGACGTGCTCGAA  71V-1658seql 
1001  CTATTCACTGGCGCCAGAACGAACACTCGACGTGCTCGAA  BFS-1703seq 


1041  GAGAACGTCGACAATCCAAATTACGACACGCTGCTGGAGA  71V-1658seql 
1041  GAGAACGTCGACAATCCAAATTACGACACGCTGCTGGAGA  BFS-1703seq 


1081  ACGTCTTGAAATGTCCATCACGCCGGCCCAAACGAAGCAT  7lV-1658seql 
1081  ACGTCTTGAAATGTCCATCACGCCGGCCCAAACGAAGCAT  BFS-1703seq 


1101  tACCGATGACTTCACACTGACCAGTCCCTACCTGGGGTTC  71V-1658seql 
1101  tACCGATGACTTCACACTGACCAGTCCCTACCTGGGGTTC  BFS-1703seg 


1161  TGCCCGTATTGCAGACACTC  aIa|  CGCCGTGTTTCAGCCCAA  71V-1658seql 
1161  TGCCCGTATTGCAGACACTCAGCGCCGTGTTTCAGCCCAA  BFS-1703seq 


1201  TAAAAATTGAGAACGTGTGGGACGAATCTGATGATGGATC  71V-1658seql 
1201  TAAAAATTGAGAACGTGTGGGACGAATCTGATGATGGATC  BFS-1703seq 


1241  GATTAGAATCCAGGTCTCGGCACAATTCGGCTACAATCAG  7lV-1658seql 
1241  GATTAGAATCCAGGTCTCGGCACAATTCGGCTACAATCAG  BFS-1703seq 


1281  GCAGGCACTGCGGATGTCACCAAATTCCGTTACATGTCTT  71V-1658seql 
1281  GCAGGCACTGcHgATGTCACCAAATTCCGTTACATGTCTT  BFS-1703seq 
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1361  AATAGCTA  t1c|  AGCACATCTGGACCCTGCCGTCGTCTTGGC  71V~1658seql 
1361  AATAGCTATTAGCACATCTGGACCCTGCCGTCGTCTTGGC  BFS-1703seq 


— I — 

1410 


— I — 

1420 

_ I _ 


1430 

_ I _ 


1401  CACAAAGGGTACTTCCTGTTAGCT 
1401  CACAAAGGGTACTTCCTGTTAGCT 


CAATGTCCTCCAG 
CAATGTCCTCCAGGTG 


1440 

_ L_ 

G  T  G  7lV-1658seql 
BFS-1703seq 


1441 

1441 


I 

1450 


— I — 

1460 


— I — 

1470 

_ I _ 


1480 


ACAGTGTAACCGTCAGTATCACGAGCGGAGCATCTGAGAA  71V-1658seql 
ACAGTGTAACCGTCAGTATCACGAGCGGAGCATCTGAGAA  BFS-1703seq 


1490 

_ I _ 


1500 

_ I _ 


— I — 

1510 

_ I _ 


1520 


71V-I658seql 

BFS-1703seq 


1481  TTCATGCACCGTGGAGAAAAAGATCAGGAGGAAGTTTGTC 
1481  TTCATGCACCGTGGAGAAAAAGATCAGGAGGAAGTTTGTC 

- — — -T - 1 - 1 - r 

1530  1540  1550  1560 

1521  GGTAGAGAGGAGTACTTGTT  C  C  C  ACCIcJgTCCATGGAAAGC  71V-1658seql 
1521  GGTAGAGAGGAGTACTTGTTCCCACCTGTCCATGGAAAGC  BFS~1703seq 


1570 


1580 


1590 

_ I _ 


1600 


1561  TGGTAAAGTGCCACGTTTACGATCACTT 
1561  TGGTAAAGTGCCACGTTTACGATCACTT 


GAAGGAGACGTC 

GAAGGAGACGTC 


71V-1658seql 

BFS-1703seq 


1610 

_ 1 _ 


— I — 

1620 

_ I _ 


1630 

I 


1640 


1601  TGCCGGGTACATAACCATGCACAGGCCAGGCCCACACGCG 
1601  T  G  C  C  G  gH  TACATAACCATGCACAGGCCAGGCCCACACGCG 


71V-1658seql 

BFS-1703seq 


1641 

1641 


— I — 

1650 


— I — 

1660 


— I — 

1670 


1680 


TATAAGTCCTATCTGGAGGAAGCGTCAGGCGAAGTGTACA  71V-1658seql 
TATAAGTCCTATCTGGAGGAAGCGTCAGGCGAAGTGTACA  BFS-1703seq 


1681 

1681 


1690 

_ I _ 


1700 


— I — 

1710 

_ I _ 


TTAAACCACCTTCTGGCAAGAACGTCACCTACGAAT 
TTAAACCACCTTCTGGCAAGAACGTCACCTACGAATGTAA  BFS~1703seq 


1720 

G  T  A  A  71V-1658seql 


1721 

1721 


- T— - 1 - 1 - r 

1730  1740  1750  1760 

GTGTGGCGACTA  C  A  G  C  A  C  A  G  G  T  A  tIc]  GTGAGCACGCG  A  A  C  G  71V-1658seql 
GTGTGGCGACTACAGCACAGGTATTGTGAGCACGCGAACG  BFS~1703seq 
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1841  TCTTATTAGGCACACAGACCACTCAGTGCAAGGTAAATTG  7lV-1658segl 
1841  TCTTATTAGGCACACAGACCACTCAGTGCAAGGTAA  a(3  T  G  BFS-1703seq 


- 1 - 1 - 1  r 

1890  1900  1910  1920 

_ I - 1 - 1 - 1- 

1881  CACATTCCATTCCGCTTGACACCGACAGTCTGCCCGGTTC  7lV-1658seql 
1881  CACATTCCATTCCGCTTGACACCGACAGTCTGCCCGGTTC  BFS-1703seq 

I - 1  I  * 

1930  1940  1950  1960 

_ r _ -1— _ I - 1- 

1921  CGTTAGCTCACACGCCTACAGTCACGAAGTGGTTCAAAGG  71V-1658seql 
1921  CGTTAGCTCACACGCCTACAGTCACGAAGTGGTTCAAAGG  BFS-1703seq 

III* 

1970  1980  1990  2000 

_ I - 1 - 1 - L. 

1961  CATCACCCTCCACCTGACTGCAATGCGACCAACATTGCTG  71V-1658seql 
1961  CATCACCCTCCACCTGACTGCA  aS  GCGACCAACATTGCTG  BFS-1703seq 

- 1 - 1  I  r 

2010  2020  2030  2040 

_ I _ Z_ _ I _ I- 

2001  ACAACGAGAAAATTGGGGCTGCGAGCAGACGCAACAGCAG  71V-1658seql 
2001  ACAACGAGAAAATTGGGGCTGCGAGCAGACGCAACAGCAG  BFS-1703seq 

- 1 — — - 1 - 1 - r 

2050  2060  2070  2080 

i  I  »  * 

2041  AATGGATTACAGGGTCTACATCCAGGAATTTTTCTGTGGG  7lv-1658seql 
2041  AATGGATTACAGG  G@  CTACATCCAGGAATTTTTCTGTGGG  BFS-1703seq 

- — - 1— - 1 - 1 - r 

2090  2100  2110  2120 

_ I - 1 - 1 - 1- 

2081  GCGAGAAGGGCTGGAGTACGTATGGGGTAACCATGAACCA  7lV-1658seql 
2081  GCGAGAAGGGCTGGAGTACGTATGGGG0AACCATGAACCA  EFS-1703seq 

- 1 - 1 - — - -I - r 

2130  2140  2150  2160 

_ I _ I - 1 - L. 

2121  GTCAGAGTCTGGGCCCAGGAGTCGGCACCAGGCGACCCAC  7lV-1658seql 
2121  GTCAGAGTCTGGGCCCAGGAGTCGGCACCAGGCGACCCAC  BFS-1703seq 


2161  ATGGATGGCCGCATGAGATCATCATCCACTATTATCATCG  7lV-1658seql 
2161  ATGGATGGCCGCATGAGATCATCATCCACTATTATCATCG  BFS-1703seq 
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—  I  I 

2210  2220  2230  2240 

_ I _ I - 1 - L- 

2201  GCATCCAGTCTACACTGTCATTGTGCTGTGTGGTGTCGCT  71V-1658seql 
2201  GCATCCAGTCTACACTGTCATTGTGCTGTGTGGTGTCGCT  BFS-1703seq 

'  I  .  I"  . . . . . . .  i 

2250  2260  2270  2280 

2241  CTTGCTATCCTGGTAGGCACTGCATCL^TCAGCAGCTTGLdA  71V-1658seql 
2241  CTTGCTATCCTGGTAGGCACTGCATCGTCAGCAGCTTGTA  BFS-1703seq 

- 1 - 1 - 1 - 1- 

2290  2300  2310  2320 

_ I _ I - 1 - 1- 

2281  TCGCCAAAGCAAGAAGAGACTGCCTGACGCCATACGCGCT  71V-1658seql 
2281  TCGCCAAAGCAAGAAGAGACTGCCTGACGCCATACGCGCT  BFS-1703seq 

r  I  I  I 

2330  2340  2350  2360 

_ I _ I _ I _ 1- 

2321  TGCACCGAACGCAACGGTACCCACAGCATTAGCGGTTTTG  71V-1658seql 
2321  TGCACCGAACGCAACGGTACCCACAGCATTAG  C0  G  T  T  T  T  G  BFS-1703seq 

■ - 1 - 1 - 1 - r 

2370  2380  2390  2400  ' 

2361  T  G  C  T  Gld  ATTCGGCCAACCAACGCTGAAACATTTGGAGAAA  71V-1658seql 
2361  TGCTGTATTCGGCCAACCAACGCTGAAACATTTGGAGAAA  BFS-1703seq 

- 1 - 1 - 1 - r 

2410  2420  2430  2440 

_ I _ I - 1 - 1- 

2401  CTTTGAACCATCTGTGGTTTAACAACCAACCGTTTCTCTG  71V-1658seql 
2401  CTTTGAACCATCTGTGGTTTAACAACCAACCGTTTCTCTG  BFS-1703seq 

■  III 

2450  2460  2470  2480 

_ I _ I _ I _ l_ 

2441  GGCACAGTTGTGCATTCCTCTGGCAGCGCTTGTTATTCTG  71V-1658segl 
2441  GGCACAGTTGTGCA  tE]  CCTCTGGCAGCGCTTGTTATTCTG  BFS-1703seq 

. I  I  I  I 

2490  2500  2510  2520 

_ I _ I - 1 - L. 

2481  TTCCGCTGCTTTTCATGCTGCATGCCTTTTTTATTGGTTG  7lV-1658seql 
2481  TTCCGCTGCTTTTCATGCTGCATGCCTTTTTTATTGGTTG  BFS-1703seq 

- 1 - 1 - 1 - r 

2530  2540  2550  2560 

_ I _ I _ I - 1- 

2521  CAGGCGTCTGCCTGGGGAAGGTAGACGCCTTCGAACATGC  71V-1658seql 
2521  CAGGCGTCTGCCTGGGGAAGGTAGACGCCTTCGAACATGC  BFS-1703seg 

lilt 

2570  2580  2590  2600 

_ I _ I _ I _ I- 

2561  GACCACTGTGCCAAATGTTCCGGGGATCCCGTATAAGGCG  71V-1658seql 
2561  GACCACTGTGCCAAATGTTCCGGGGATCCCGTATAAGGCG  BFS-1703seq 

- ^ - , - , - r- 

2610  2620  2630  2640 

2601  TTGGTCGAAC  gIcJg  CAGGTTACGCGCCACTTA  A|cjc  T  G  G  A  G  A  7lV-1658seql 
2601  TTGGTCGAACGTGCAGGTTACGCGCCACTTAATCTGGAGA  BFS-1703seq 
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-  "  I  I  '  .  I  '  . ..—11^- - 

2650  2660  2670  2680 

- - - 1 - 1 - 1 - 1- 

2641  T  C  A  cLg|  GTCGTCTCATCGGAATTAACACCTTCAACTAACAA  71V-1658seql 
2641  TCACTGTCGTCTCATCGGAATTAACAC  C0  TCAACTAACAA  BFS-1703seq 

- 1 - 1 - 1 - r 

2690  2700  2710  2720 

- 1 - J-. - JL_ - 1 - 1- 

2681  GGAGTACGTGACCTGC  AL^  ATTCCACACAGTCATTCCTTCA  71V-1658seql 
2681  GGAGTACGTGACCTGCAGATTCCACACAGTCATTCCTTCA  BFS-1703seq 

- 1 - 1 - 1 - r 

2730  2740  2750  2760 

2721  CCACAAGTTAAATGCTGCGGGTCCCTCGAGTGldAAGGCAT  7lV-1658seql 
2721  CCACAAGTTAAATGCTGCGGGTCCCTCGAGTGTAAGGCAT  BFS-1703seg 

- 1 - 1 - 1 - r 

2770  2780  2790  2800 

_ I _ I _ I _ L. 

2761  CCTCAAAGGCGGATTACACATGCCGCGTTTTTGGCGGTGT  7lV-1658seql 
2761  C  C  T  C  A  A  aS  GCGGATTACACATGCCGCGTTTTTGGCGGTGT  BFS-1703seq 

- 1 - 1 - 1 - r 

2810  2820  2830  2840 

2801  GTACCCTTTCATGTGGGGAGGCGCAC  aIaI  TGCTTCTGTGAC  71V-1658seql 
2801  GTACCCTTTCATGTGGGGAGGCGCACAGTGCTTCTGTGAC  BFS-1703seq 

- 1 - 1 - 1 - r 

2850  2860  2870  2880 

_ I _ j_ _ I _ l_ 

2841  AGTGAGAACACACAACTGAGTGAGGCGTACGTCGAGTTCG  71V-1658seql 
2841  AGTGAGAACACACAACTGAGTGAGGcBtACGTCGAGTTCG  BFS-1703seq 

- 1 - 1 - 1 - r 

2890  2900  2910  2920 

2881  CTCCAGACTGCACTA  t1a|g  A  T  C  a1c|g  CAGTCGCACTAAAAGT  71V-1658seql 
2881  CTCCAGACTGCACTATTGATCATGCAGTCGCACTAAAAGT  BFS-1703seq 

- 1 - 1 - 1 - r 

2930  2940  2950  2960 

_ I _ 1 _ 1 _ l_ 

2921  TCACACAGCTGCTCTGAAAGTCGGCCTGCGTATAGTATAC  7lV-1658seql 
2921  TCACACAGCTGCTCTGAAAGTCGGCCTGCGTATAGTATAC  BFS-1703seq 

- — - 1 - 1 - 1 - 1- 

2970  2980  2990  3000 

2961  G  G  C  A  ALd  ACCACCGCGCACCTGGATACGTTTGTCAATGGCG  71V-1658seql 
2961  GGCAATACCACCGCGCACCTGGATACGT  T0  GTCAATGGCG  BFS-1703seq 

- 1 - 1 - 1 - r 

3010  3020  3030  3040 

_ I _ I _ I _ L. 

3001  TCACGCCAGGTTCCTCACGGGACCTGAAGGTCATAGCAGG  71V-1658seql 
3001  TCACGCCAGGTTCCTCACGGGACCTGAAGGTCATAGCAGG  BFS-1703seq 

- 1 - 1 - 1 - r 

3050  3060  3070  3080 


3041  GCCGATATCAGCCGCTTTTTCACCCTTTGACCATAAGGTC  71V-1658segl 
3041  GCCGATATCAGCCGCTTTTTCACCCTTTGACCATAAGGTC  BFS-1703seq 
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3081  GTCATCAGAAAGGGGCTTGTTTACAACTACGACTTCCCTG  71V-1658seql 
3081  GTCATCAGAAAGGGGCTTGTTTACAACTACGACTTCCCTG  BFS-1703seq 


3121  AGTATGGAGCTATGAAACCAGGAGCGTTCGGCGATATTCA  71V-1658seql 
3121  a[a]  TATGGAGCTATGAAACCAGGAGCGTTCGGCGATATTCA  BFS-1703seq 


3161  AGCATCCTCGCTTGATGCTACAGACATAGTAGCCCGCACT  71V-1658seql 
3161  AGCATCCTCGCTTGATG  cS  ACAGACATAGTAGCCCGCACT  BFS-1703seq 


3201  GACATACGGCTGCTGAAGCCTTCTGTCAAGAACATCCACG  71V-1658seql 
3201  GACATACGGCTGCTGAAGCCTTCTGTCAAGAACATCCACG  BFS-1703seq 


3241  TCCCCTACACCCAAGCAGTATCAGGGTATGAAATGTGGAA  71V-1658seql 
3241  TCCCCTACACCCAAGCAGTATCAGGGTATGAAATGTGGAA  BFS-1703seq 


3281  GAACAACTCAGGACGACCCCTGCAAGAAACAGCACCATTT  71V-1658seql 
3281  GAACAACTCAGGACGACCCCTGCAAGAAACAGCACCATTT  BFS-1703seq 


3321  GGATGTAAAATTGAAGTGGAGCCTCTGCGAGCGTCTAACT  7lV-1658seql 
3321  GGATGTAAAATTGAAGTGGAGCCTCTGCGAGCGTCTAACT  BFS-1703seg 


3361  GTGCTTACGGGCACATCCCTATCTCGATTGACATCCCTGA  71V-1658seql 
3361  GTGCTTACGGGCACATCCCTATCTCGATTGACATCCCTGA  BFS-1703seq 


3401  TGCAGCTTTTGTGAGATCATCAGAATCACCAACAATTTTA  71V-1658seql 
3401  [JgCAGCTTTTGTGAGATCATCAGAATCACCAACAATTTTA  BFS-1703seq 


3441  GAAGTTAGCTGCACAGTAGCAGACTGCATTTATTCTGCAG  71V-1658seql 
3441  GAAGTTAGCTGCACAGTAGCAGACTGCATTTATTCTGCAG  BFS-1703seq 


3481  ACTTTGGTGGTTCTCTAACATTACAGTACAAAGCTGACAG  71V-1658seql 
3481  ACTTTGGTGGTT  C0  CTAACATTACAGTACAAAGCTGACAG  BFS-1703seq 
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- - - 1 - 1 - 1 - 

3530  3540  3550  3560 

_ I _ I _ I _ I- 

3521  GGAGGGACATTGTCCAGTTCACTCCCACTCCACGACAGCT  71V-1658seql 
3521  GGAGGGACATTGTCCAGTTCACTCCCACTCCAcSaCAGCT  BFS-1703seq 

- 1^- - 1 - 1 - r 

3570  3580  3590  3600 

_ I _ I _ I _ L. 

3561  GTTTTGAAGGAAGCGACCACACATGTGACTGCCGTAGGCA  7lV-1658seql 
3561  GrTTTTGAAGGAAGCGACCACACATGTGACTGCCGTAGGCA  BFS-1703seq 

- 1 - 1 - 1  T 

3610  3620  3630  3640 

_ u - 1 - 1 - 1- 

3601  GCATAACACTACATTTTAGCACATCGAGCCCACAAGCAAA  7lV-1658seql 
3601  GCATAACACTACATTTTAGCACATCGAGCCCACAAGCAAA  BFS~1703seq 

- ^ - 1 - 1 - r- 

3650  3660  3670  3680 

_ l - 1 - 1 - L. 

3641  TTTTATAGTTTCGCTATGCGGCAAGAAGTCCACCTGCAAT  71V-1658seql 
3641  TTTTATAGTTTCGCTATGCGGCAAGAA  gH  CCACCTGCAAT  BFS-1703seg 

I  I  I  I 

3690  3700  3710  3720 

_ I _ I _ I _ L 

3681  GCTGAATGTAAACCACCGGCCGACCACATAATTGGAGAAC  7lV-1658seql 
3681  GCTGAATGTAAACCACCGGCCGACCACATAATTGGAGAAC  BFS-1703seq 

- 1 - 1 - 1 - r 

3730  3740  3750  3760 

_ I _ I _ I _ l_ 

3721  CACATAAAGTCGACCAAGAATTCCAGGCGGCAGTTTCCAA  71V-1658seql 
3721  CACATAAAGTCGACCAAGAATTCCAGGCGGCAGTTTCCAA  BFS-1703seg 

I  I  I  ^ 

3770  3780  3790  3800 

- 1 - - - -i- - 1 - 1- 

3761  AACATCTTGGA  a1c|  TGGCTGCTTGCACTGTTTGGGGGAGCA  7lV-1658seql 
3761  AACATCTTGGAATTGGCTGCTTGCACTGTTTGGGGGAGCA  BFS-1703seq 

- 1 - 1 - 1 - r 

3810  3820  3830  3840 

_ I _ I - 1 - L- 

3801  TCATCCCTCATTGTTGTAGGACTTATAGTGTTGGTCTGCA  71V-1658seql 
3801  TCATCCCTCATTGTTGTAGGACTTATAGTGTTGGTCTGCA  BFS-1703seq 

. .  1"  - - 1 - i - r 

3850  3860  3870  3880 

_ I _ j _ I _ l_ 

3841  GCTCTATGCTTATAAACACACGTAGATGACTGAGCGCGGA  71V-1658seql 
3841  GCTCTATGCTTATAAACACACGTAGATGACTGAGCGCGGA  BFS-1703seq 

- - , - , - 1 - 

3890  3900  3910  3920 

_ I _ I _ I _ l_ 

3881  CACTGACATAGCGGTAAAACTCGATGTACTTCCGAGGAAG  7lV-1658seql 
3881  CACTGACATAGCGGTAAAACTCGATGTACTTCCGAGGAAG  BFS-1703seq 

- 1 - 1 - 1 - r 

3930  3940  3950  3960 

_ I _ I - 1 - L. 

3921  CGTGGTGCATAATGCCACGCGCCGCTTGACACTAAAACTC  71V-1658seql 
3921  CGTGGTGCATA  aQ  GCCACGCGCCGCTTGACACTAAAACTC  BFS-1703seg 
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- 1 - , - - - 1 - 1 - 1— 

3970  3980  3990  4000 

_ I - 1 - 1 - >- 

3961  GATGTATTTCCGAGGAAGCACAGTGCATAATGCTGTGCAG  71V-1658seql 
3961  GATGTATTTCCGAGGAAGCACAGTGCATAATGCTGTGCAG  BFS-1703seq 

- 1 - 1 - 1 - 1" 

4010  4020  4030  4040 

_ I - 1 - 1 - 1- 

4001  TGTCACATTAATCGTATATCACACTACATATTAACAACAC  71V-1658seql 
4001  TGTCACATTAATCGTATATCACACTACATATTAACAACAC  BFS-1703seq 

- 1 - , - , - 

4050  4060  4070  4080 

_ ^ - 1 - 1 - 1- 

4041  TATATCACTTTTATGAGACTCACTATGGGTCTCTAATATA  7lV-1658seql 
4041  TATATCACTTTTATGAGACTCACTATGGGTCTCTAATATA  BFS-1703seq 


—I - 1 - 1 - 1" 

4090  4100  4110  4120 


4081  CACTACACATATTTTACTTAAAAACACTATACACACTTTA  BFS-1703seq 


4121 

4121 


T 


T 


4130  4140  4150  4160 

T  A  A  A  T  T  G  t|  c|  t|c|ataatttIcaIctttI - - - 

TAAATTCTTTTATAATTTTTCTTTTGTTTTTATTTTGTTT 


71V-1658segl 

BFS-1703seq 


- r 

4170 

I 

4145  I - 1  A  G  G  T  T  T 

4161  T  T  A  aIa  aI  T  T  T0 


Decoration  'Decoration  #1' :  Box  residues  that  differ  from  the  Consensus. 


71V-1658segl 

BFS-1703seq 


UNCLASSIFIED 


B1 


Appendix  B  Alignment  of  the  amino  acid  sequences  of  the  26S  region  of  WEE  strain  71V- 
1658  and  BFS-1703. 


UNCLASSIFIED 
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MFPYPQLNFPPVYPTNPMAYRDPNPPRRRWRPFRPPLAAQ  71V-1658prt 
MFPYPQLNFPPVYPTNPMAYRDPNPP  R0  RWRPFRPPLAAQ  BFS-1703prt 


I  E  D  L  R  R  S  iLa  NLTFKQRSPNPPPGPPPKKKKSAPKPKPTQP  71V-1658prt 
lEDLRRSIANLTFKQRSPNPPPGPPPKKKKSAPKPKPTQP  BFS-1703prt 


K  K  K  K  Q  Q  A  k[r1  T  KRKPKPGKRQRMCMKLESDKTFPIMLNGQV  71V-1658prt 
KKKKQQAKKTKRKPKPGKRQRMCMKLESDKTFPIMLNGQV  BFS-1703prt 


NGYACVVGGRLMKPLHVEGKIDNEQLAAVKLKKASMYDLE  71V-1658prt 
NGYACVVGGRLMKPLHVEGKIDNEQLAAVKLKKASMYDLE  BFS-1703prt 


161  YGDVPQNMKSDTLQYTSDKPPGFYNWHHGAVQYENGRFTV  7lV-1658prt 
161  YGDVPQNMKSDTLQYTSDKPPGFYNWHHGAVQYENGRFTV  EFS-1703prt 


201  PRGVGGKGDSGRPILDNRGRVVAIVLGGANEGTRTALSVV  71V-1658prt 
201  PRGVGGKGDSGRPILDNRGRVVAIVLGGANEGTRTALSVV  BFS-1703prt 


241  TWNQKGVT  DTPEGSEPWSLVTALCVLSNVTFPCDKPPV  7lV-1658prt 
241  TWNQKGVTIKDTPEGSEPWSLVTALCVLSNVTFPCDKPPV  BFS-1703prt 


281  C  Y  S  L0  PERTLDVLEENVDNPNYDTLLENVLKCPSRRPKRS  71V-1658prt 
281  CYSLAPERTLDVLEENVDNPNYDTLLENVLKCPSRRPKRS  BFS-1703prt 


321  ITDDFTLTSPYLGFCPYCRHSl^PCFSPIKIENVWDESDDG  71V-1658prt 
321  ITDDFTLTSPYLGFCPYCRHSAPCFSPIKIENVWDESDDG  BFS-1703prt 


361  SIRIQVSAQFGYNQAGTADVTKFRYMSFDHDHDIKEDS  MIS  71V-1658prt 
361  SIRIQVSAQFGYNQAGTADVTKFRYMSFDHDHDIKEDSMD  BFS-1703prt 


401  KIAISTSGPCRRLGHKGYFLLAQCPPGDSVTVSITSGASE  7lV-1658prt 
401  KIAISTSGPCRRLGHKGYFLLAQCPPGDSVTVSITSGASE  BFS-1703prt 
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T" 

450 

_ I _ 


460 

_ L— 


470 

-J _ 


“T" 

480 


441 

441 


481 

481 


521 

521 


561 

561 


601 

601 


NSCTVEKKIRRKFVGREEYLFPPVHGKLVKCHVYDHLKET 

NSCTVEKKIRRKFVGREEYLFPPVHGKLVKCHVYDHLKET 


— I— 

490 


I 

500 


510 


— P- 

520 


SAGYITMHRPGPHAYKSYLEEASGEVYI 

SAGYITMHRPGPHAYKSYLEEASGEVYI 


KPPSGKNVTYEC 

KPPSGKNVTYEC 


— I 

530 


I 

540 


550 

_ I _ 


560 


KCGDYSTGIVSTRTKMNGCTKAKQCIAYKSDQTKWVFNSP 

KCGDYSTGIVSTRTKMNGCTKAKQCIAYKSDQTKWVFNSP 


— I — 

570 


— r~ 

580 

_ I _ 


— P- 

590 


— IP 

600 


DLIRHTDHSVQGKLHIPFRLTPTVCPVPLAHTPTVTKWFK 

DLIRHTDHSVQGKLHIPFRLTPTVCPVPLAHTPTVTKWFK 


— I — 

610 


I 

620 


630 


— r 

640 


G  I  T  L  H  L  T  A[i^  RPTLLTTRKLGLRADATAE  WITGSTSRNFSV 
gitlhltatrptllttrklglradataewitg@tsrnfsv 


641 

641 


681 

681 


721 

721 


761 

761 


801 

801 


841 

841 


— r~ 

650 


i 

660 


I — 

670 

_ I _ 


680 

_ L- 


GREGLEYVWGNHEPVRVWAQESAPGDPHGWPHEI  I  IHYYH 
GREGLEYVWGNHEPVRVWAQESAPGDPHGWPHEIIIHYYH 


— r~ 

690 

_ I _ 


— I — 

700 


— r“ 

710 


— P 

720 

_ I- 


RHPVYTVIVLCGVALAILVGTASSAACIAKARRDCLTPYA 

RHPVYTVIVLCGVALAILVGTASSAACIAKARRDCLTPYA 


730 


740 


750 


— P 

760 


LAPNATVPTALAVLCCIRPTNAETFGETLNHLWFNNQPFL 

LAPNATVPTALAVLCCIRPTNAETFGETLNHLWFNNQPFL 


— P_ 

770 


— \ — 

780 


— I— 

790 


— P 

800 


WAQLCIPLAALVILFRCFSCCMPFLLVAGVCLGKVDAFEH 

WAQLCIPLAALVILFRCFSCCMPFLLVAGVCLGKVDAFEH 


— P_ 

810 


— 

820 


830 


— P 

840 


ATTVPNVPG 

ATTVPNVPG 


IPYKALVERAGYAPLNLE 

IPYKALVERAGYAPLNLE 


ITVVSSELTPSTN 

ITVVSSELTPSTN 


— P_ 

860 

_ { _ 


— P 

880 

_ I- 


850 


I 

870 


KEYVTCKFHTVIPSPQVKCCGSLECKASSKADYTCRVFGG 
K  E  Y  V  T  cSf  HTVIPSPQVKCCGSLECKASSKADYTCRVFGG 


71V-1658prt 

BFS-1703prt 


71V-165^rt 

BFS“1703prt 


71V-1658prt 

BFS-1703prt 


7lV-165^rt 

BFS-1703prt 


71V-1658prt 

BFS-1703prt 


71V-I65^rt 

BFS“1703prt 


71V-1658prt 

BFS-1703prt 


71V-I65^rt 

BFS-1703prt 


71V-I658prt 

BFS-1703prt 


71V-I658prt 

BFS-1703prt 


71V-I658prt 

BFS-1703prt 
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- 1 - 1 - i - 1 - r 

890  900  910  920 

_ I - 1 - 1 - 1- 

881  VYPFMWGGAQCFCDSENTQLSEAYVEFAPDCTIDHAVALK  71V-1658pi± 

881  VYPFMWGGAQCFCDSENTQLSEAYVEFAPDCTIDHAVALK  BFS-1703prt 

- 1 - 1 - 1 - r 

930  940  950  960 

_ I _ I - 1 - 1- 

921  VHTAALKVGLRIVYGNTTAHLDTFVNGVTPGSSRDLKVIA  71V-1658prt 

921  VHTAALKVGLRIVYGNTTAHLDTFVNGVTPGSSRDLKVIA  BFS-1703prt 

- 1 - 1 - 1 - r 

970  980  990  1000 

_ I _ I _ I _ L. 

961  GPISAAFSPFDHKVVIRKGLVYNYDFPEYGAMKPGAFGDI  7lV-1658prt 

961  GPISAAFSPFDHKVVIRKGLVYNYDFPEYGAMKPGAFGDI  BFS-1703prt 


1010  1020  1030  1040 

_ I _ I _ I _ l_ 

1001  QASSLDATDIVARTDIRLLKPSVKNIHVPYTQAVSGYEMW  71V-1658prt 
1001  QASSLDATDIVARTDIRLLKPSVKNIHVPYTQAVSGYEMW  HFS-1703prt 

I  I  I  I 

1050  1060  1070  1080 

_ I _ I - 1 - 1- 

1041  KNNSGRPLQETAPFGCKIEVEPLRASNCAYGHIPISIDIP  71V-1658prt 
1041  KNNSGRPLQETAPFGCKIEVEPLRASNCAYGHIPISIDIP  BFS-1703prt 

- 1 - 1 - 1 - r 

1090  1100  1110  1120 

I  I  I  I 

1081  DAAFVRSSESPTILEVSCTVADCIYSADFGGSLTLQYKAD  7lV-1658prt 
1081  DAAFVRSSESPTILEVSCTVADCIYSADFGGSLTLQYKAD  BFS-1703prt 

- — — - 1 - 1 - 1 - r 

1130  1140  1150  1160 

_ I _ I - 1 - 1- 

1121  REGHCPVHSHSTTAVLKEATTHVTAVGSITLHFSTSSPQA  71V-1658prt 
1121  REGHCPVHSHSTTAVLKEATTHVTAVGSITLHFSTSSPQA  BFS-1703prt 

■  III 

1170  1180  1190  1200 

_ 1 _ I _ I - 1- 

1161  NFIVSLCGKKSTCNAECKPPADHIIGEPHKVDQEFQAAVS  7lV-1658prt 
1161  NFIVSLCGK  k[t1  TCNAECKPPADHIIGEPHKVDQEFQAAVS  BFS-1703prt 

— - 1 - 1 - 1 - 

1210  1220  1230 

_ I _ I - 1 - - 

1201  KTSWNWLLALFGGASSLIVVGLIVLVCSSMLINTRR.  71V-1658prt 

1201  KTSWNWLLALFGGASSLIVVGLIVLVCSSMLINTR  rU  BFS-1703prt 

Decoration  ‘Decoration  #1':  Box  residues  that  differ  frcm  the  Consensus. 
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Appendix  C  a)  Sequence  of  the  5'  end  of  WNSTA-3  (WEE  71V-1658)  aligned  with  the  5' 
ends  of  HJ  (HJVOltnmc),  EEE  NA  [EEENA5'(WEAV)]  and  EEE  SA  (EEESAOl);  and 
b)  Sequence  of  the  3'  end  of  WNSTA-3  (WEE  7 IV- 1658)  aligned  with  the  5'  end  of  WEE  strain 
BPS  1703  (Weaver,  et  al,  1993)  as  described  in  text. 
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1  IatagggtIgItggtgtagag  gI  aIaIcIcItIa  c  c  c  t  a  cIa  aIa  c  t  a  aItIcI  WNgiA3-Mi3Ry 

I  - - - -  - - -|  —  I - HJVOltrunc 

1  IaTAGGGTATGGTGTAGAGGCAACCACCCT  aIt  T  - ^  EEENA5 '  (WEAV) 

1  ATAGGGTATGGTGTAGAGGCaIgIcCACC  cI^A  aclT^^-  -Pic  cl  EEESAOl 


41  gIa  TCCAaItIaTGG  aI  aIaI  gH  aFt  TCACGTTGACTTAGaI  T  Ig  C  T  G  aI  WNSIA3-Ml3Ry 

1  - - - -  —  HJVOltrunc 

39  ItATCCAAAATGGAGAAAGTTC  A[iJg  TTGACTTAGACG  cL^G  a  EEENA5 '  (WEAV) 
38  TATCCAAAATGGAGAAAG  Tf^C  A  C  G  tF^G  ACTTAGACGCTGaI  EEESAOl 


Alignment  Report  of  Untitled,  using  Clustal  method  with  Weighted  residue  weight  table. 
Monday,  October  6, 1997  12:00  PM 


1  G  Ay  CCACCATTTCACGAGTTTGCGTACGAGAGTCTCAAGA  vnsta3t7-rc 
1  GATCCACCATTTCACGAGTTTGCGTACGAGAGTCTCAAGA  WEQ01065TRC 


41  C  C  G  GACCAGCAGCACCTC  AlN  Nl  AAAGTCCCAACCATCGGAG  vmsta3t7-rc 
41  cIa  d  GACCAGCAGCACCTCACAA  aQ  GTCCCAACCATCGGAG  WED01065TRC 


81  TCTATGGAGTGCCAGGTTCAGGTAAATCTGGAAT  CIN  NIC  A  A  vjnsta3t7-rc 
80  TCTATGGAGTGCCAGGTTCAGGTAAATCTGGAATCATCAA  WEU01065TRC 


121  AAGCGCTGTGACTAAGAAAGATCTGGTTGTGAGTGCGAAG  vznsta3t7-rc 
120  AAGCGCTGTGACTAAGAAAGATCTGGTTGTGAGTGCGAAG  WEQOlOeSTRC 


161  AAGGAAAACTGCGCAGAAAT[c|ATCAGGGATGTAAGGAGGA  wnsta3t7-rc 
160  AAGGAAAACTGCGCAGAAATTATCAGGGATGTAAGGAGGA  WEU01065TRC 


201  TGAGACGTATGGATGTTGCTGCTAGGACTGTTGATTCAGT  wnsta3t7-rc 
200  TGAGACGTATGGATGTTGCTGCTAGGACTGTTGATTCAGT  WEa01065TRC 


241  G  C  T  T  C  T  A^ _ _ _  wnsta3t7-rc 

240  G  C  T  T  C  T  aIa  ATGGGGTTAAGCACCCCGTTAACACCCTGTAcI  WEU01065TRC 


wrista3t7-rc 
WEU01065'IIRC 

Decoration  'Decoration  #1' :  Box  residues  that  differ  frcm  the  Consensus. 
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